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state of at least one gene or associated regulatory region of the gene and identifying aberrant methylation of regions of the gene or 
^ regulatory region, wherein aberrant methylation is identified as being diflerent when compared to the same regions of the gene or 
associated regulatory region in a subject not having said cellular proliferative, thereby delecting a cellular proliferative disorder in 
the subject. 
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DIFFERENTIALLY METHYLATED SEQUENCES 
IN PANCREATIC CANCER 

FIELD OF THE INVENTION 

The present invention relates generally to the regulation of gene expression 
and more specifically to a method of determining the DNA methylation status of CpG 
sites in a given locus and correlating the methylation status with the presence of a cell 
proliferative disorder. 

5 BACKGROUND OF THE INVENTION 

DNA methylases transfer methyl groups from the universal methyl donor S- 
adenosyl methionine to specific sites on the DNA. Several biological functions have 
been attributed to the methylated bases in DNA. The most established biological 
function for methylated DNA is the protection of DNA from digestion by cognate 

10 restriction enzymes. The restriction modification phenomenon has, so far, been 

observed only in bacteria. Mammalian cells, however, possess a different methylase 
that exclusively methylates cytosine residues that are 5' neighbors of guanine (CpG). 
This modification of cytosine residues has important regulatory effects on gene 
expression, especially when involving CpG rich areas, known as CpG islands, located 

1 5 in the promoter regions of many genes. 

Methylation has been shown by several lines of evidence to play a role in gene 
activity, cell differentiation, tumori genesis, X-chromosome inactivation, genomic 
imprinting and other major biological processes (Razin, A., H., and Riggs, R.D. eds. 
in DNA Methylation Biochemistry and Biological Significance , Springer- Verlag, New 

20 York, 1984). In eukaryotic cells, methylation of cytosine residues that are 

immediately 5' to a guanosine, occurs predominantly in CG poor regions (Bird, A., 
Nature, 321:209, 1986). In contrast, CpG islands remain unmethylated in normal 
cells, except during X-chromosome inactivation (Migeon, et al., supra) and parental 
specific imprinting (Li, et al., Nature, 366:362, 1993) where methylation of 5' 

25 regulatory regions can lead to transcriptional repression. De novo methylation of the 
Rb gene has been demonstrated in a small fraction of retinoblastomas (Sakai, et al., 
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Am. J. Hum. Genet., 48:880, 1991), and recently, a more detailed analysis of the VHL 
gene showed aberrant methylation in a subset of sporadic renal cell carcinomas 
(Herman, et al., Proc. Natl. Acad. Sci., U.S.A., 91:9700, 1994). Expression of a tumor 
suppressor gene can also be abolished by de novo DNA methylation of a normally 
5 unmethylated CpG island (Issa, et al., Nature Genet., 7:536, 1994; Herman, et al., 
supra; Merlo, et al., Nature Med., 1:686, 1995; Herman, et al., Cancer Res., 56:722, 
1996; Graff, et al., Cancer Res., 55:5195, 1995; Herman, et al., Cancer Res., 55:4525, 
1995). 

Human cancer cells typically contain somatically altered nucleic acid, 

10 characterized by mutation, amplification, or deletion of critical genes. In addition, the 
nucleic acid from human cancer cells often displays somatic changes in DNA 
methylation (E.R. Fearon, et al., Cell, 61:759, 1990; P.A. Jones, et al., Cancer Res., 
46:461, 1986; R. Holliday, Science, 238:163, 1987; A. De Bustros, et al., Proc. Natl. 
Acad. Sci., USA, 85:5693, 1988); P.A. Jones, et al., Adv. Cancer Res., 54:1, 1990; 

15 S.B. Baylin, et al., Cancer Cells, 3:383, 1991; M. Makos, et al., Proc. Natl. Acad. Sci., 
USA, 89:1929, 1992; N. Ohtani-Fujita, et al., Oncogene, 8:1063, 1993). However, the 
precise role of abnormal DNA methylation in human tumorigenesis has not been 
established. Aberrant methylation of normally unmethylated CpG islands has been 
described as a frequent event in immortalized and transformed cells, and has been 

20 associated with transcriptional inactivation of defined tumor suppressor genes in 
human cancers. In the development of colorectal cancers (CRC), a series of tumor 
suppressor genes (TSG) such as APC, p53, DCC and DPC4 are inactivated by 
mutations and chromosomal deletions. Some of these alterations result from a 
chromosomal instability phenotype described in a subset of CRC. Recently, an 

25 additional pathway has been shown to be involved in a familial form of CRC, 

hereditary non-polyposis colorectal cancer. The cancers from these patients show a 
characteristic mutator phenotype which causes microsatellite instability (MI), and 
mutations at other gene loci such as TGF-f}-RII (Markowitz et al., Science, 
268(52 15): 1336-8, 1995) and BAX. This phenotype usually results from mutations in 

30 the mismatch repair (MMR) genes hMSH2 and hMLHl. A subset of sporadic CRC 
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also show MI, but mutations in MMR genes appear to be less frequent in these 
tumors. 

Another molecular defect described in CRC is CpG island (CGI) methylation, 
CGIs are short sequences rich in the CpG dinucleotide and can be found in the 5' 

5 region of about half of all human genes. Methylation of cytosine within 5' CGIs is 
associated with loss of gene expression and has been seen in physiological conditions 
such as X chromosome inactivation and genomic imprinting. Aberrant methylation of 
CGIs has been detected in genetic diseases such as the fragile-X syndrome, in aging 
cells and in neoplasia. About half of the tumor suppressor genes which have been 

10 shown to be mutated in the germline of patients with familial cancer syndromes have 
also been shown to be aberrantly methylated in some proportion of sporadic cancers, 
including Rb, VHL, pi 6, hMLHl, and BRCA1. TSG methylation in cancer is usually 
associated with (Antequera, et ah, Proc. Natl. Acad. Sci. USA, 90:1 1995-1 1999, 
1993) lack of gene transcription and (Baylin, et al., Adv. Cancer Res., 72:141-196, 

15 1998) absence of coding region mutation. Thus it has been proposed that CGI 
methylation serves as an alternative mechanism of gene inactivation in cancer. 

The causes and global patterns of CGI methylation in human cancers remain 
poorly defined. Aging could play a factor in this process because methylation of 
several CGIs could be detected in an age-related manner in normal colon mucosa as 

20 well as in CRC. In addition, aberrant methylation of CGIs has been associated with 
the MI phenotype in CRC as well as specific carcinogen exposures. However, an 
understanding of aberrant methylation in CRC has been somewhat limited by the 
small number of CGIs analyzed to date. In fact, previous studies have suggested that 
large numbers of CGIs are methylated in immortalized cell lines, and it is not well 

25 understood whether this global aberrant methylation is caused by the cell culture 
conditions or whether they are an integral part of the pathogenesis of cancer. 



Most of the methods developed to date for detection of methylated cytosine 
depend upon cleavage of the phosphodiester bond alongside cytosine residues, using 
either methylation-sensitive restriction enzymes or reactive chemicals such as 
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hydrazine which differentiate between cytosine and its 5-methyl derivative. Genomic 
sequencing protocols which identify a 5-MeC residue in genomic DNA as a site that is 
not cleaved by any of the Maxam Gilbert sequencing reactions have also been used, 
but still suffer disadvantages such as the requirement for large amount of genomic 
5 DNA and the difficulty in detecting a gap in a sequencing ladder which may contain 
bands of varying intensity. 

Mapping of methylated regions in DNA has relied primarily on Southern 
hybridization approaches, based on the inability of methylation-sensitive restriction 
enzymes to cleave sequences which contain one or more methylated CpG sites. This 
10 method provides an assessment of the overall methylation status of CpG islands, 
including some quantitative analysis, but is relatively insensitive and requires large 
amounts of high molecular weight DNA. 

Another method utilizes bisulfite treatment of DNA to convert all 
unmethylated cytosines to uracil. The altered DNA is amplified and sequenced to 
15 show the methylation status of all CpG sites. However, this method is technically 
difficult, labor intensive and without cloning amplified products, it is less sensitive 
than Southern analysis, requiring approximately 10% of the alleles to be methylated 
for detection. 

Identification of the earliest genetic changes in tumorigenesis is a major focus 
20 in molecular cancer research. Diagnostic approaches based on identification of these 
changes are likely to allow implementation of early detection strategies and novel 
therapeutic approaches targeting these early changes might lead to more effective 
cancer treatment. 

About half of all human genes have 5' CpG islands and these islands are 
25 usually associated with the 5' regulatory regions of genes (Antequera, et al., Proc. 
Natl. Acad. Sci. USA 90:1 1995-1 1999, 1993). The 5' CpG islands of most 
nonimprinted genes are thought to remain unmethylated in normal cells but may 
become methylated during aging or tumorigenesis. Through interactions between 
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methyl CpG binding proteins, histones and histone deacetylase, 5' CpG island 
methylation can contribute to changes in chromatin that cause transcriptional silencing 
(Baylin, et al., Adv. Cancer Res. 72: 141-196, 1998). Promoter methylation is 
implicated in the transcriptional silencing of tumor suppressor and mismatch repair 

5 genes (e.g. pi 6, Rb, VHL, hMLHl) in many cancers. Although 13 hypermethylated 
genes and clones in pancreatic cancers were previously identified (Ueki, et al., Cancer 
Res. 60:1835-1 839, 2000), there almost certainly are others. Costello et al. have 
estimated that -400 genes are aberrantly methylated in cancers and have found 
evidence for tumor-specific pattern of methylation (Costello, et al., Nat. Genet. 

10 24:132-138, 2000). A better description of the pattern of DNA methylation 
abnormalities in cancer may improve an understanding of the role of DNA 
methylation in tumorigenesis and identification of differentially methylated CpG 
islands in cancer may lead to the discovery of novel genes with tumor suppressor 
properties. Finally, identified genes or loci could be utilized as cancer-specific 

15 markers for the early detection of cancer (Belinsky, et al., Proc. Natl. Acad. Sci. USA 
95:11891-11896,1998). 

Pancreatic cancer is the fourth leading cause of cancer death in men and in 
women and each year -28,000 Americans die of the disease (Greenlee, et al., CA 
Cancer J. Clin. 50:7-33, 2000). Frequent genetic changes such as mutational 

20 activation of the K-ras oncogene and inactivation of the pi 6, DPC4, p53, MKK4, 
STK1 1, TGFBR2, and TGFBR1 tumor suppressor genes have been described in 
pancreatic cancer (Goggins, et al., Ann. Oncol. 10:4-8, 1999, Rozenblum, et al., 
Cancer Res. 57:1731-1734, 1997). Although multiple tumor suppressor pathways 
have been shown to play a role in pancreatic carcinogenesis, little is known about the 

25 contribution of DNA methylation to inactivation of genes in these pathways. Recently, 
a novel technique, methylated CpG island amplification (MCA), was developed to 
enrich for methylated CpG rich sequences. MCA coupled with RDA (MCA/RDA) can 
recover CpG islands differentially methylated in cancer cells (Toyota, et al., Cancer 
Res. 59:2307-2312, 1997). 
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SUMMARY OF THE INVENTION 

The present invention is based on the finding that several genes are newly 
identified as being differentially methylated in cancer. This seminal discovery is 
useful for cancer screening, risk-assessment, prognosis, minimal-residual disease 

5 identification, staging and identification of therapeutic targets. The identification of 
new genes that are methylated in cancer, aging or diseases associated with aging 
increases the likelihood of finding genes methylated in a particular cancer; increases 
the sensitivity and specificity of methylation detection; allows methylation profiling 
using multiple genes; and allows identification of new targets for therapeutic 

10 intervention. The invention also provides a newly identified gene that is a target for 
hypermethylation in human tumors. 

In one embodiment, there are provided methods for detecting a cellular 
proliferative disorder in a subject. The subject may have or be at risk of having a 
cellular proliferative disorder. The method of the invention is useful for diagnostic as 

15 well as prognostic analyses. One method for detecting a cellular proliferative disorder 
in a subject includes contacting a nucleic acid-containing specimen from the subject 
with an agent that provides a determination of the methylation state of at least one 
gene or associated regulatory region of the gene; and identifying aberrant methylation 
of regions of the gene or regulatory region, wherein aberrant methylation is identified 

20 as being different when compared to the same regions of the gene or associated 

regulatory region in a subject not having the cellular proliferative, thereby detecting a 
cellular proliferative disorder in the subject. The method includes multiplexing by 
utilizing a combination of primers for more than one loci, thereby providing a 
methylation "profile" for more than one gene or regulatory region. 

25 For the first time, the invention provides methylated forms of genes and/or 

their associated regulatory sequences referred to herein as MICP1-42. MICP39-42 
have no homology to known human sequences. Eleven clones matched human genes 
(MICP1-1 1); 10 clones matched human ESTs (MICP12-21); 5 clones matched human 
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CpG islands (MICP22-26); and 12 clones matched human genome sequences 
(MICP27-38). (see Table 1 as reference). 

Invention methods include determining, in a nucleic acid-containing specimen 
taken from a subject, the methylation state of a gene or regulatory sequences 

5 associated therewith, wherein the expression or non-expression of the gene is 

associated with the presence of the cellular proliferative disorder, and identifying as 
having a cellular proliferative disorder a subject that has aberrant methylation of 
regions of the gene or associated regulatory sequences when compared to the same 
regions of the gene in a subject not having the cellular proliferative disorder. In one 

10 aspect of this embodiment, the methylated regions of the gene and associated 

regulatory sequences are contained within CpG islands (i.e., CpG rich regions). In 
particular, the aberrant methylation typically includes hypermethylation as compared 
with the same regions of the gene or regulatory sequences in a subject not having the 
cellular proliferative disorder. 

Determining the methylation state of the gene includes contacting the nucleic 
acid-containing specimen with an agent that modifies unmethylated cytosine, 
amplifying a CpG-containing nucleic acid in the specimen by means of CpG-specific 
oligonucleotide primers, wherein the oligonucleotide primers distinguish between 
modified methylated and nonmethylated nucleic acid, and detecting the methylated 
nucleic acid based on the presence or absence of amplification products produced in 
said amplifying step. The method includes optionally contacting the amplification 
products with a methylation sensitive restriction endonuclease. Other methods for 
determining methylation status of a gene and/or regulatory sequences are well known 
in the art and are described more fully herein. 

25 In another embodiment of the present invention there is provided a kit useful 

for the detection of a cellular proliferative disorder in a subject having or at risk for 
having a cellular proliferative disorder. Invention kits include a carrier means 
compartmentalized to receive a sample, one or more containers comprising a first 
container containing a reagent which modifies unmethylated cytosine and a second 



15 



20 
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container containing primers for amplification of a CpG-containing nucleic acid, 
wherein the primers distinguish between modified methylated and nonmethylated 
nucleic acid, and optionally, a third container containing a methylation sensitive 
restriction endonuclease. In a preferred embodiment, the cell proliferative disorder is 
5 pancreatic carcinoma. 

To identify CpG islands differentially methylated in pancreatic 
adenocarcinoma, methylated CpG island amplification (MCA) was used, coupled with 
representational difference analysis (MCA/RDA). Clones differentially methylated 
(termed MICP, methylated in carcinoma of the pancreas) were isolated in a panel of 8 
pancreatic cancer cell lines compared to normal pancreas. 95% of these clones were 
CpG islands and among these clones were 5' CpG islands of several known genes, 
including Cyclin G and Preproenkephalin (ppENK, encoding [Met5]-enkephalin). 
Seven of the clones (Cyclin G, ppENK, MICP20, 23, 33, 35 and 36) were not 
methylated in 14 normal pancreata by MSP while 15 primary pancreatic 
adenocarcinomas were methylated in 7%, 87%, 13%, 53%, 33%, 40% and 0% of 
cases, respectively. Two of the 5 chronic pancreatitis specimens harbored methylation 
of three (Cyclin G, ppENK and MICP23) and one (ppENK) clones, respectively. 
There was no identification of methylation of MICP33 and 35 in any normal 
gastrointestinal tissues tested. Three clones (ppENK, MICP20 and 23) were variably 
methylated in normal gastric, duodenal and colonic mucosae. Aberrant methylation of 
Cyclin G and ppENK in methylated pancreatic cancer cell lines was associated with 
transcriptional silencing that was reversible with 5-aza-2'-deoxycytidine treatment. 

These data indicate that multiple CpG islands undergo de novo methylation 
during pancreatic carcinogenesis. Methylation of some CpG islands is cancer-specific 
25 while others show tissue-specific patterns of methylation. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows the representative results of MCDPs isolated by MCA/RDA. 
Dot blot analysis. 1 A, An example of kinetic enrichment of methylated sequences by 



15 
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RDA. MCA products from the driver and the tester (PL8) and the PCR products from 
first (1st SA), second (2nd SA) and third (3rd SA) competitive hybridization/selective 
amplification were blotted onto the membrane and hybridized with a labeled Cyclin G 
probe. IB, Dot blot analysis using MICPs isolated from MCA/RDA as probes. First 
5 three MICPs were hybridized only to the tester (either PL3 or PL8), whereas the next 
three MICPs were weakly hybridized to the driver as well as the tester. 

Figure 2 shows a CpG plot across the 5' CpG islands of Cyclin G and ppENK 
showing the relation between isolated clones and their corresponding genes. The 
positions of MSP primers are also indicated. The black boxes represent exons. 

10 Figure 3 shows a summary of the average level of methylation of selected 

MCA/RDA MICPs by bisulfite-modified genomic sequencing. The specimen number 
and the age of the patients are in the left column. Empty oval, 0-10% methylation; 
white oval with black dots, 1 1-30% methylation; black oval with white dots, 31-70% 
methylation; black oval, 71-100% methylation; not determined. NP, normal 

15 pancrease; PCa, primary pancreatic adenocarcinoma. 

Figure 4 shows MSP analyses of MICP36 (A) and MICP25 (B) in primary 
pancreatic adenocarcinomas and normal tissues. The PCR products in the lanes U and 
M indicate the presence of unmethylated and methylated templates, respectively. 
PCa, primary pancreatic carcinoma, NP, normal pancreas, NC, normal colonic 

20 mucosa. Expression of Cyclin G (C) and ppENK (D) in pancreatic cancer cell lines 
and normal pancreata by RT-PCR analysis. Cyclin G and ppENK were coamplified 
with GAPDH to ensure the RNA integrity. Cyclin G was expressed at low level in the 
methylated cell line PL8 compared with unmethylated cell lines (PL3, CAPAN2, 
MiaPaca2, and Panel) and four normal pancreata. 5 Aza-dC treatment increased 

25 Cyclin G expression in PL8. All methylated cell lines examined (PL3, CAPAN2, 
MiaPaca2 and Panel) lacked expression of ppENK, whereas all three normal 
pancreata expressed ppENK. Treatment of four cell lines with 5 Aza-dC restored the 
ppENK expression. The expected sizes of the PCR products are 306 bp for GAPDH, 
207 bp for Cyclin G, and 179 bp for ppENK. 
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Figure 5 shows aberrant methylation of MICPs and clinicopathological 
variables. Multivariate linear regression analysis was used to determine the 
relationship between the number of aberrantly methylated MICPs in a pancreatic 
adenocarcinoma and the variables patient age (left plot) and tumor diameter (right 
5 plot). For each decade increase in age, the average number of methylated loci 
increases by 0.28, regardless of the tumor diameter. For each increase in tumor 
diameter (in cm), the number of mehtylated loci increases by 0.156 regardless of 
patient age. 

Figure 6a-6i shows the nucleic acid sequence for SEQ ID NO: 1-42. 

10 DETAILED DESCRIPTION OF THE INVENTION 

It has been determined that an aberrant methylation state of nucleic acids in 
certain genes, particularly regulatory sequences, is diagnostic for the presence or 
potential development of a cellular proliferative disorder in subjects bearing the 
aberrantly methylated nucleic acids. More particularly, the hypermethylation of 

15 certain nucleotides localized in CpG islands has been shown to affect the expression 
of genes associated with the CpG islands; typically such hypermethylated genes have 
reduced or abolished expression, primarily due to down-regulated transcription. 
Using a recently developed PCR-based technique called methylated CpG island 
amplification (MCA), several nucleic acid molecules aberrantly methylated in 

20 pancreatic cancer cells line were identified. 

Of the 42 unique clones isolated using MCA/RDA, seven (Cyclin G, ppENK, 
MICP20, 23, 33, 35 and 36) were CpG islands aberrantly methylated in pancreatic 
carcinoma compared to normal pancreas. Indeed, none of these 7 clones, were 
methylated in a panel of normal pancreata using the highly sensitive method of MSP. 
25 Two of these seven clones corresponded to the 5' CpG island of the genes ppENK and 
Cyclin G. ppENK encodes opioid growth factor, also known as [Met5]-enkephalin. 
This opioid peptide induces apoptosis in lung cancer cell lines (Maneckjee & Minna, 
Cell Growth Differ. 5:1033-1040, 1994) and several studies have demonstrated that 
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this peptide has a negative growth regulatory effect on various kinds of cancers, 
including pancreatic cancer (Zagon, et al., Int. J. Oncol. 14:577-584, 1999). Aberrant 
methylation of the 5' CpG island of ppENK was found in 87% of primary pancreatic 
carcinomas. Hypermethylation was associated with transcriptional silencing of this 
5 gene in pancreatic cancer cell lines. Although a low level of methylation in other 
normal mucosae by MSP was also observed, these results indicate that de novo 
methylation of 5' CpG islands and transcriptional repression of ppENK might 
contribute to pancreatic carcinogenesis. 

Cyclin G is a target for transcriptional activation by p53 (Okamoto, et al., 
10 Embo. J. 13:48616-4822, 1994). Conflicting functions for Cyclin G have been 

reported. When overexpressed Cyclin G is associated with cell growth in vitro (Smith, 
et al., Exp. Cell. Res. 230:61-68, 1997), while increased expression of Cyclin G also 
augments apoptosis of multiple cancer cell lines in response to different stimuli 
(Okamoto & Prives, Oncogene 18:4606-4615, 1999). The data demonstrate the 
15 association between hypermethylation of the 5' CpG island of Cyclin G and its 

transcriptional repression in the pancreatic cancer cell line PL8 (Fig. 4C), suggesting 
that Cyclin G could have been selected for silencing because of tumor suppressive 
functions. These data extend previous findings demonstrating aberrant methylation of 
multiple cancer-related genes including pi 6 and hMLHl in pancreatic cancer (Ueki, et 
20 al., Cancer Res. 60:1835-1839, 2000). 

Methylation of several clones in pancreata with chronic pancreatitis was also 
observed. Two of the 5 pancreata with chronic pancreatitis harbored aberrant 
methylation and one of these two pancreata contained a PanIN lesion and this latter 
sample displayed methylation of 3 clones. Previous studies demonstrated that chronic 

25 pancreatitis is a significant risk factor of development of pancreatic cancer 

(Lowenfels, et al., N. Engl. J. Med. 328:1433-1437, 1993) and duct lesions (PanIN) 
often found in chronic pancreatitis are considered as the precursors of infiltrating 
pancreatic carcinoma (Hruban, et al., Clin. Cancer Res. 6:2969-2972, 2000). The 
presence of aberrant methylation in DNA from chronic pancreatitis suggests that de 

30 novo methylation of CpG islands may be an early event in pancreatic cancer 



.02066694A1 J_> 



WO 02/068694 



PCT/US02/05681 



12 

development in this setting. Additional studies are needed to determine the role and 
the timing of de novo methylation of CpG islands in progression model of pancreatic 
cancer (Hruban, et al., Clin. Cancer Res. 6:2969-2972, 2000). The absence of 
methylated templates of M1CP33 and 35 in any normal tissue examined raises the 
5 possibility that MSP could be used to detect aberrant methylation of these clones in 
clinical samples such as stool, blood, or pancreatic fluid, for the early detection of 
pancreatic cancers. 

Three additional known genes containing methylated 5'CpG islands in 
pancreatic cancers were identified (GAD1, ECEL1 and PAX5). These genes were 

10 isolated by MCA/RDA because relatively fewer DNA templates were methylated in 
normal pancreas. Some genes that are methylated in cancer and in only a small 
percentage of normal cells may appear to undergo selection during carcinogenesis 
(Salem, et al., Cancer Res. 60:2473-2476, 2000), but are merely unselected epigenetic 
markers of stem cells that have evolved to cancers by other clonal selection events. 

15 This phenomenon is important to be aware of when studying methylation in human 

cancer and makes it much more difficult to assign causality to methylation phenomena 
in cancers compared to genetic events such as homozygous deletion. For normally 
unmethylated genes whose function is well characterized such as hMLHl, or for genes 
that are methylated as a second hit for a tumor suppressor gene (e.g. VHL, E- 

20 cadherin), or for tumor suppressor genes alternatively targeted by genetic and 

epigenetic inactivation (e.g. pl6 and RB) (Baylin, et al., Adv. Cancer Res. 72:141- 
196, 1998), the biological significance of "aberrant methylation" is well accepted. As 
additional genes are identified that are methylated in pancreatic cancer, it will be 
important to rule out low-level methylation using sensitive techniques such as MSP 

25 (Herman, et al., Proc. Natl. Acad. Sci. USA 93:9821-9826, 1996) before such genes 
are accepted as having undergone selection through methylation. 

Three CpG islands that were differentially methylated in pancreatic cancer but 
were not located within 5' regions of their corresponding genes were also identified, 
CSX, MCT3 and ICAM5. The role of hypermethylation of non-5* CpG islands need 
30 to be defined, but recent studies have shown that aberrant methylation in cancer is not 
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confined to the 5' region but can occur in internal exons and 3' regions of genes 
(Costello, et al., Nat. Genet. 24:132-138, 2000, Liang, et al., Genomics 53:260-268, 
1998). It is probable that several of the unknown clones identified are genes whose 
expression is repressed by aberrant methylation. For example, GRAJL and 
5 GENSCAN programs suggest putative coding exons downstream of MICP23 and 
MICP33. 

In addition to identifying CpG islands with cancer-related methylation, CpG 
islands that were methylated in both normal pancreata and neoplastic tissues were 
isolated. Methylation of non-neoplastic colorectal (Ahuja, et al., Cancer Res. 

10 58:5489-5494, 1998, Toyota, et al., Proc. Natl. Acad. Sci. USA 96:8681-8686, 1999), 
bladder and prostate tissues (Liang, et al., Genomics 53:260-268, 1998) has been 
observed for several genes and CpG islands. Methylation in normal tissues that is not 
the result of imprinting is frequently "age-related." This has been best shown in the 
colonic mucosa for genes such as ER (Ahuja, et al., Cancer Res. 58:5489-5494, 1998, 

15 Toyota, et al., Proc. Natl. Acad. Sci. USA 96:8681-8686, 1999). However, normal 
pancreata used in this study were obtained from 14 patients (mean age of 62) and for 
at least 5 of these clones there was no evidence of age-related methylation (Fig. 3). 
This suggests that the aberrant methylation observed in the pancreatic cancers in this 
study is not simply a function of age. Some of the clones listed in Table 1 could 

20 undergo age-related methylation, but to demonstrate this would require many more 

normal pancreata. Methylation of ppENK, MICP20 and 23 in a significant percentage 
of the DNA templates within normal gastric, duodenal and colonic mucosae highlights 
the tissue specific nature of DNA methylation. This low level methylation of different 
normal tissues might also explain some of the tumor-specific methylation patterns 

25 observed by others (Costello, et al., Nat. Genet. 24: 1 32-138, 2000). The data indicate 
that there are tissue-specific methylation patterns in normal tissues (Liang, et al., 
Genomics 53:260-268, 1998) and age-related methylation changes are probably 
restricted to certain genes and/or tissues (Ahuja, et al., Cancer Res. 58:5489-5494, 
1998). 
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In this study, the MCA/RDA technique was modified and these modifications 
may have improved the efficiency of the MCA/RDA technique. Betaine was included 
in the PCR reaction and amplified the methylated templates under a higher annealing 
temperature (77°C). The combination of betaine and DMSO can uniformly amplify a 

5 mixture of DNA with different GC content (Baskaran, et al., Genome Res. 6:633-638, 
1996). These modifications might have enhanced the amplification of distinct clones 
instead of Alu repetitive sequences that accounted for 60% of the recovered clones 
using the original protocol (Toyota, et al., Cancer Res. 59:2307-2312, 1997). The 
subtractive and kinetic enrichment of differentially methylated sequence by RDA as 

10 shown in this study (Fig. 1 A) may have advantages over other techniques to isolate 

differentially methylated sequences between normal tissue and cancer (Costello, et al., 
Nat. Genet. 24:132-138, 2000, Liang, et al., Genomics 53:260-268, 1998). 
MCA/RDA, however, has limitations for identifying differentially methylated 
sequences. First, MCA only detects differentially methylated sequences with two 

15 restriction enzyme sites (Smal in this study). Second, MCA/RDA not only identifies 
absolute differences in methylation between the tester and the driver, it also will 
identify methylated sequences both in the tester and the driver if for example, there is 
low level methylation in the driver (normal pancreas). Third, some clones that 
appeared to be unmethylated by dot-blot hybridization were indeed methylated by 

20 bisulfite sequencing in normal pancreata (MICP15, Fig. IB and Fig. 4C). Finally, 

even if bisulfite sequencing suggests that there is no methylation of a clone in normal 
pancreata, it is important to rule out low-level methylation in normal tissues using a 
sensitive technique, such as MSP. 

The results indicate that aberrant methylation of CpG islands is a common 
25 event in pancreatic carcinogenesis. Some de novo methylated CpG islands could 
serve as cancer-specific markers while others reflect tissue specific pattern of DNA 
methylation. 

Methylated nucleic acid sequences are also provided. For the first time, the 
present invention provides methylated chemical structures for MICP 1-42 (see Table 
30 1). One of skill in the art can now readily locate the CpG-rich sequences associated 



BNSDOCID: <WO 02O68694A1_l_> 



WO 02/068694 



PCT/US02/05681 



15 

with these genes and identify such methylated forms of the genes/regulatory sequences 
by methods described herein (The gene sequences can be identified in a gene database 
found at http://www.ncbi.nim.nih.gov/UniGene/index.html). The invention provides 
CpG-rich regions from the above genes as set forth in SEQ ED Nos 1-42, equivalent to 
5 M1CP 1-42, respectively. 

The term "polynucleotide'^ "nucleic acid sequence" refers to a polymeric 
form of nucleotides at least 10 bases in length. An "isolated polynucleotide" is a 
polynucleotide that is not immediately contiguous with both of the coding sequences 
with which it is immediately contiguous (one on the 5' end and one on the 3' end) in 

10 the naturally occurring genome of the organism from which it is derived. Thus, an 
isolated polynucleotide may include a coding region with its associated regulatory 
sequences. The term therefore includes, for example, a recombinant DNA which is 
incorporated into a vector; into an autonomously replicating plasmid or virus; or into 
the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule 

15 (e.g., a cDNA) independent of other sequences. The nucleotides of the invention can 
be ribonucleotides, deoxyribonucleotides, or modified forms of either nucleotide. 
Specifically, methylated forms of nucleotides in a polynucleotide sequence are also 
included. The term includes single and double forms of DNA. 

As will be understood by those of skill in the art, when the sequence is RNA, 
20 the deoxynucleotides A, G, C, and T of SEQ ID NO: 1-42, are replaced by 

ribonucleotides A, G, C, and U, respectively. Also included in the invention are 
fragments of the above-described nucleic acid sequences that are at least 15 bases in 
length, which is sufficient to permit the fragment to selectively hybridize to DNA that 
encodes polypeptides encoded by SEQ ED NO: 1-42. The term "selectively hybridize" 
25 refers to hybridization under moderately or highly stringent conditions (See, 
Sambrook, as cited herein) which excludes non-related nucleotide sequences. 

The nucleic acid sequence includes the disclosed sequence and sequences that 
encode conservative variations of the polypeptides encoded by polynucleotides 
provided herein. The term "conservative variation" as used herein denotes the 
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replacement of an amino acid residue by another, biologically similar residue. 
Examples of conservative variations include the substitution of one hydrophobic 
residue such as isoleucine, valine, leucine or methionine for another, or the 
substitution of one polar residue for another, such as the substitution of arginine for 
5 lysine, glutamic for aspartic acid, or glutamine for asparagine, and the like. The term 
"conservative variation" also includes the use of a substituted amino acid in place of 
an unsubstituted parent amino acid provided that antibodies raised to the substituted 
polypeptide also immunoreact with the unsubstituted polypeptide. 

Nucleic acid sequences of the invention can be expressed in vitro by DNA 
10 transfer into a suitable host cell. "Host cells" are cells in which a vector can be 

propagated and its DNA expressed. The cell may be prokaryotic or eukaryotic. The 
term also includes any progeny of the subject host cell. It is understood that all 
progeny may not be identical to the parental cell since there may be mutations that 
occur during replication. However, such progeny are included when the term "host 
15 cells" is used. Methods of stable transfer, meaning that the foreign DNA is 
continuously maintained in the host, are known in the art. 

In one aspect, the nucleic acid sequences may be inserted into an expression 
vector. The term "expression vector" refers to a plasmid, virus or other vehicle 
known in the art that has been manipulated by insertion or incorporation of the 

20 sequence of interest genetic sequences. Polynucleotide sequence which encode 
sequence of interest can be operatively linked to expression control sequences. 
"Operatively linked" refers to a juxtaposition wherein the components so described 
are in a relationship permitting them to function in their intended manner. An 
expression control sequence operatively linked to a coding sequence is ligated such 

25 that expression of the coding sequence is achieved under conditions compatible with 
the regulatory or expression control sequences. As used herein, the terms "regulatory 
sequences" and "expression control sequences" refers to nucleic acid sequences that 
regulate the expression of a nucleic acid sequence to which it is operatively linked. 
Expression control sequences are operatively linked to a nucleic acid sequence when 

30 the expression control sequences control and regulate the transcription and, as 
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appropriate, translation of the nucleic acid sequence. Thus expression control 
sequences can include appropriate promoters, enhancers, transcription terminators, a 
start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, 
maintenance of the correct reading frame of that gene to permit proper translation of 
5 mRNA, and stop codons. The terms "regulatory sequences" and "expression control 
sequences" are intended to included, at a minimum, components whose presence can 
influence expression, and can also include additional components whose presence is 
advantageous, for example, leader sequences and fusion partner sequences. An 
example of an expression control sequence includes a promoter. 

10 A "promoter" is a minimal sequence sufficient to direct transcription. Also 

included in the invention are those promoter elements which are sufficient to render 
promoter-dependent gene expression controllable for cell-type specific, tissue- 
specific, or inducible by external signals or agents; such elements may be located in 
the 5' or 3' regions of the gene. Both constitutive and inducible promoters, are 

15 included in the invention (see, e.g., Bitter et al., Methods in Enzymology 153 :5 16- 
544, 1987). For example, when cloning in bacterial systems, inducible promoters 
such as pL of bacteriophage plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like 
may be used. When cloning in mammalian cell systems, promoters derived from the 
genome of mammalian cells (e.g., metallothionein promoter) or from mammalian 

20 viruses (e.g., the retrovirus long terminal repeat; the adenovirus late promoter; the 
vaccinia virus 7.5K promoter) may be used. Promoters produced by recombinant 
DNA or synthetic techniques may also be used to provide for transcription of the 
nucleic acid sequences of the invention. 

In the present invention, the polynucleotide sequences may be inserted into an 
25 expression vector which contains a promoter sequence which facilitates the efficient 
transcription of the inserted genetic sequence of the host. The expression vector 
typically contains an origin of replication, a promoter, as well as specific genes which 
allow phenotypic selection of the transformed cells. Vectors suitable for use in the 
present invention include, but are not limited to the T7-based expression vector for 
30 expression in bacteria (Rosenberg et al., Gene 56:125, 1987), the pMSXND 
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expression vector for expression in mammalian cells (Lee and Nathans, J. Biol. Chem. 
263 :3521, 1988) and baculovirus-derived vectors for expression in insect cells. The 
DNA segment can be present in the vector operably linked to regulatory elements, for 
example, a promoter (e.g., T7, metallothionein I, or polyhedron promoters). 

5 Polynucleotide sequences of the invention can be expressed in either 

prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and mammalian 
organisms. Methods of expressing DNA sequences having eukaryotic or viral 
sequences in prokaryotes are well known in the art. Biologically functional viral and 
plasmid DNA vectors capable of expression and replication in a host are known in the 

10 art. Such vectors are used to incorporate DNA sequences of the invention. 

'Transformation" means a genetic change induced in a cell following 
incorporation of new DNA (i.e., DNA exogenous to the cell). Where the cell is a 
mammalian cell, the genetic change is generally achieved by introduction of the DNA 
into the genome of the cell (i.e., stable). 

15 Thus, a "transformed cell" is a cell into which (or into an ancestor of which) 

has been introduced, by means of recombinant DNA techniques, a DNA molecule 
encoding sequence of interest. Transformation of a host cell with recombinant DNA 
may be carried out by conventional techniques as are well known to those skilled in 
the art. Where the host is prokaryotic, such as E. coli, competent cells which are 

20 capable of DNA uptake can be prepared from cells harvested after exponential growth 
phase and subsequently treated by the CaC12 method using procedures well known in 
the art. Alternatively, MgC12 or RbCl can be used. Transformation can also be 
performed after forming a protoplast of the host cell if desired. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
25 phosphate co-precipitates, conventional mechanical procedures such as 

microinjection, electroporation, insertion of a plasmid encased in liposomes, or virus 
vectors may be used. Eukaryotic cells can also be cotransformed with DNA 
sequences encoding the sequence of interest, and a second foreign DNA molecule 
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encoding a selectable phenotype, such as the herpes simplex thymidine kinase gene. 
Another method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or 
bovine papilloma virus, to transiently infect or transform eukaryotic cells and express 
the protein (see for example, Eukaryotic Viral Vectors , Cold Spring Harbor 
5 Laboratory, Gluzman ed., 1982). 

Isolation and purification of microbial expressed polypeptide, or fragments 
thereof, provided by the invention, may be carried out by conventional means 
including preparative chromatography and immunological separations involving 
monoclonal or polyclonal antibodies. 

10 In one embodiment, the invention provides substantially purified polypeptide 

encoded by polynucleotide sequences SEQ ID NO: 1-42. The term "substantially 
purified" as used herein refers to a polypeptide which is substantially free of other 
proteins, lipids, carbohydrates or other materials with which it is naturally associated. 
One skilled in the art can purify a polypeptide sequence using standard techniques for 

15 protein purification. The substantially pure polypeptide will yield a single major band 
on a non-reducing polyacrylamide gel. The purity of the polypeptide can also be 
determined by amino-terminal amino acid sequence analysis. 

Minor modifications of the primary amino acid sequences may result in 
proteins which have substantially equivalent activity as compared to the unmodified 
20 counterpart polypeptide described herein. Such modifications may be deliberate, as 
by site-directed mutagenesis, or may be spontaneous. All of the polypeptides 
produced by these modifications are included herein as long as the biological activity 
still exists. 

The polypeptides of the invention also include dominant negative forms of the 
25 invention polypeptide which do not have the biological activity of invention 

polypeptide sequence. A "dominant negative form" of invention is a polypeptide that 
is structurally similar to invention polypeptide but does not have wild-type invention 
function. For example, a dominant-negative invention polypeptide may interfere with 
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wild-type invention function by binding to, or otherwise sequestering, regulating 
agents, such as upstream or downstream components, that normally interact 
functionally with the invention polypeptide. 

To identify genes differentially methylated in colorectal cancer, methylated 
CpG island amplification was used followed by representational difference analysis 
(Razin and Cedar, Cell 17: 473-476, 1994, herein incorporated by reference). One of 
the clones recovered (MINT3 1 , see U.S. Patent Application Serial No. 09/309,175, 
incorporated by reference herein in its entirety) mapped to human chromosome 17q21 
using a radiation hybrid panel. A Blast search revealed this fragment to be completely 
identical to part of a BAC clone (Genbank: AC004590) sequenced by high throughput 
genomic sequencing. The region surrounding MINT31 fulfills the criteria of a CpG 
island: GC content 0.67, CpG/GpC ratio 0.78 and a total of 305 CpG sites in a 4 kb 
region. Using this CpG island and 10 kb of flanking sequences in a Blast analysis, 
several regions highly homologous to the rat T-type calcium channel gene, 
CACNA1G, were identified (Perez-Reyes et al., Nature 391_: 896-900. 1998, herein 
incorporated by reference). Several ESTs were also identified in this region. Using 
Genscan, 2 putative coding sequences (Gl, and G2) were identified. Blastp analysis 
revealed that Gl has a high homology to the EH-domain-binding protein, epsin, while 
G2 is homologous to a C. elegans hypothetical protein (accession No. 2496828). 

20 Due to the clear correlation between methylation of CpG islands and cellular 

proliferative disorders, in another embodiment of the present invention, there are 
provided methods for detecting a cellular proliferative disorder in a subject having or 
at risk for said cellular proliferative disorder. The method includes assaying, in 
nucleic acid-containing specimen taken from said subject, the methylation state of a 

25 gene or its associated regulatory regions, wherein the expression state of the gene or 
its associated regulatory regions is associated with the presence of the cellular 
proliferative disorder, and identifying as having a cellular proliferative disorder a 
subject that has aberrant methylation of regions of said gene. The method provides for 
detecting a cellular proliferative disorder in a subject having or at risk for said cellular 

30 proliferative disorder by identifying aberrantly methylation of regions of a gene when 



10 



BNSDOCID: <WO 02068694A 1 _l_> 



WO 02/068694 



PCT/US02/05681 



21 

compared to the same regions of the gene in a subject not having said cellular 
proliferative disorder. 

The aberrant methylation comprises hypermethylated CpG rich regions (i.e., 
islands). In one aspect of the present invention, the CpG rich regions are associated 
5 with the invention genes gene, and affect the expression thereof in a methylation state- 
dependent manner. 

A "cell proliferative disorder'* or "cellular proliferative disorder" is any 
disorder in which the proliferative capabilities of the affected cells is different from 
the normal proliferative capabilities of unaffected cells. An example of a cell 

10 proliferative disorder is neoplasia. Malignant cells (i.e., cancer) develop as a result of 
a multistep process. Specific, non-limiting examples of cell proliferative disorders 
associated with increased methylation of CpG-islands are low grade astrocytoma, 
anaplastic astrocytoma, glioblastoma, medulloblastoma, gastric cancer, colorectal 
cancer, colorectal adenoma, acute myelogenous leukemia, lung cancer, renal cancer, 

15 pancreatic cancer, leukemia, breast cancer, prostate cancer, endometrial cancer and 
neuroblastoma. 

A cell proliferative disorder as described herein may be a neoplasm. Such 
neoplasms are either benign or malignant. The term "neoplasm" refers to a new, 
abnormal growth of cells or a growth of abnormal cells that reproduce faster than 

20 normal. A neoplasm creates an unstructured mass (a tumor) which can be either 
benign or malignant. For example, the neoplasm may be a head, neck, lung, 
esophageal, stomach, small bowel, colon, bladder, kidney, or cervical neoplasm. The 
term "benign" refers to a tumor that is noncancerous, e.g. its cells do not proliferate or 
invade surrounding tissues. The term "malignant" refers to a tumor that is 

25 metastastic or no longer under normal cellular growth control. 

A cell proliferative disorder may be an age-associated disorder. Examples of 
age-associated disorders which are cell proliferative disorders include colon cancer, 
lung cancer, breast cancer, prostate cancer, and melanoma, amongst others. 
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A "nucleic acid containing specimen" includes any type of material containing 
a nucleic acid to be subject to invention methods. The nucleic acid may be contained 
in a biological sample. Such samples include but are not limited to any bodily fluid, 
such as a serum, urine, saliva, blood, cerebrospinal fluid, pleural fluid, ascites fluid, 
5 sputum, stool, or a biopsy sample. 

Samples or specimens include any CpG-rich DNA sequence, whatever the 
origin, as long as the sequence is detectably present in a sample. While routine 
diagnostic tests may not be able to identify cancer cells in these samples, the method 
of the present invention identifies neoplastic cells derived from the primary tumor or 

10 from a metastases. The method includes non-invasive sampling (e.g., bodily fluid) as 
well as invasive sampling (e.g., biopsy). The sample of DNA of the subject may be 
serum, plasma, lymphocytes, urine, sputum, bile, stool, cervical tissue, saliva, tears, 
pancreatic juice, duodenal juice, cerebral spinal fluid, regional lymph node, 
histopathologic margins, and any bodily fluid that drains a body cavity or organ. 

15 Therefore, the method provides for the non-invasive detection of various tumor types 
including head and neck cancer, lung cancer, esophageal cancer, stomach cancer, 
small bowel cancer, colon cancer, bladder cancer, kidney cancers, cervical cancer and 
any other organ type that has a draining fluid accessible to analysis. For example, 
neoplasia of regional lymph nodes associated with a primary mammary tumor can be 

20 detected using the method of the invention. Regional lymph nodes for head and neck 
carcinomas include cervical lymph nodes, prelaryngeal lymph nodes, pulmonary 
juxta-esophageal lymph nodes and submandibular lymph nodes. Regional lymph 
nodes for mammary tissue carcinomas include the axillary and intercostal nodes. 
Samples also include urine DNA for bladder cancer or plasma or saliva DNA for head 

25 and neck cancer patients. 

Any nucleic acid sample, in purified or nonpurified form, can be utilized as the 
starting nucleic acid or acids in accordance with the present invention, provided it 
contains, or is suspected of containing, a nucleic acid sequence containing a target 
locus (e.g., CpG-containing nucleic acid). In general, the CpG-containing nucleic acid 
30 is DNA. However, invention methods may employ, for example, samples that contain 
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DNA, or DNA and RNA, including messenger RNA, wherein DNA or RNA may be 
single stranded or double stranded, or a DNA-RNA hybrid may be included in the 
sample. A mixture of nucleic acids may also be employed. The specific nucleic acid 
sequence to be detected may be a fraction of a larger molecule or can be present 
5 initially as a discrete molecule, so that the specific sequence constitutes the entire 

nucleic acid. It is not necessary that the sequence to be studied be present initially in a 
pure form; the nucleic acid may be a minor fraction of a complex mixture, such as 
contained in whole human DNA. The nucleic acid-containing sample used for 
detection of methylated CpG may be from any source including, but not limited to, 
10 brain, colon, urogenital, lung, renal, pancreas, liver, esophagus, stomach, 

hematopoietic, breast, thymus, testis, ovarian, and uterine tissue, and may be extracted 
by a variety of techniques such as that described by Maniatis, et al. ( Molecular 
Cloning: a Laboratory Manual , Cold Spring Harbor, NY, pp 280, 281, 1982). 

The nucleic acid of interest can be any nucleic acid where it is desirable to 
15 detect the presence of a differentially methylated CpG island. The CpG island 
comprises a CpG island located in a gene or regulatory region for a gene. A "CpG 
island" is a CpG rich region of a nucleic acid sequence. The nucleic acid sequence 
may include, for example, MICP 1-42. However, any gene or nucleic acid sequence 
of interest containing a CpG sequence can provide diagnostic information (i.e., via its 
20 methylation state) using invention methods. 

Moreover, these markers can also be multiplexed in a single amplification 
reaction to generate a low cost, reliable cancer screening test for many cancers 
simultaneously. 

A combination of DNA markers for CpG-rich regions of nucleic acid may be 
25 amplified in a single amplification reaction. The markers are multiplexed in a single 
amplification reaction, for example, by combining primers for more than one locus. 
For example, DNA from a urine sample can be amplified with three different 
randomly labeled primer sets, such as those used for the amplification of the MICP38- 
42 loci, in the same amplification reaction. The reaction products are separated on a 
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denaturing polyacrylamide gel, for example, and then exposed to film for visualization 
and analysis. By analyzing a panel of markers, there is a greater probability of 
producing a more useful methylation profile for a subject. 

If the sample is impure (e.g., plasma, serum, stool, ejaculate, sputum, saliva, 
5 cerebrospinal fluid, or blood or a sample embedded in paraffin), it may be treated 
before amplification with a reagent effective for lysing the cells contained in the 
fluids, tissues, or animal cell membranes of the sample, and for exposing the nucleic 
acid(s) contained therein. Methods for purifying or partially purifying nucleic acid 
from a sample are well known in the art (e.g.,Sambrook et al., Molecular Cloning: a 
10 Laboratory Manual , Cold Spring Harbor Press, 1989, herein incorporated by 
reference). 

In order to detect a differential methylation state for a gene or CpG-containing 
region of interest, invention methods include any means known in the art for detecting 
such differential methylation. For example, detecting the differential methylation may 
include contacting the nucleic acid-containing specimen with an agent that modifies 
unmethylated cytosine, amplifying a CpG-containing nucleic acid in the specimen by 
means of CpG-specific oligonucleotide primers, wherein the oligonucleotide primers 
distinguish between modified methylated and nonmethylated nucleic acid, and 
detecting the methylated nucleic acid based on the presence or absence of 
amplification products produced in said amplifying step. This embodiment includes 
the PCR-based methods described in U.S. Patent No. 5,786,146, incorporated herein 
in its entirety. 

For the first time, the methylation state of a number of genes has been 
correlated with cell proliferative disorders. Examples of such genesand their NCBI 
25 accession numbers, including the location of the clone, are set out in Table 1. 

In another embodiment, detection of differential methylation is accomplished 
by contacting a nucleic acid sample suspected of comprising a CpG-containing 
nucleic acid with a methylation sensitive restriction endonuclease that cleaves only 
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unmethylated CpG sites under conditions and for a time to allow cleavage of 
unmethylated nucleic acid. The sample is further contacted with an isoschizomer of 
the methylation sensitive restriction endonuclease, that cleaves both methylated and 
unmethylated CpG-sites, under conditions and for a time to allow cleavage of 

5 methylated nucleic acid. Oligonucleotides are added to the nucleic acid sample under 
conditions and for a time to allow ligation of the oligonucleotides to nucleic acid 
cleaved by the restriction endonuclease, and the digested nucleic acid is amplified by 
conventional methods such as PCR wherein primers complementary to the 
oligonucleotides are employed. Following identification, the methylated CpG- 

10 containing nucleic acid can be cloned, using method well known to one of skill in the 
art (see Sambrook et ah, Molecular Cloning: a Laboratory Manual , Cold Spring 
Harbor Press, 1989). 

As used herein, a "methylation sensitive restriction endonuclease" is a 
restriction endonuclease that includes CG as part of its recognition site and has altered 

15 activity when the C is methylated as compared to when the C is not methylated. 

Preferably, the methylation sensitive restriction endonuclease has inhibited activity 
when the C is methylated (e.g., Smal). Specific non-limiting examples of a 
methylation sensitive restriction endonucleases include Sma I, BssHII, or Hpall. Such 
enzymes can be used alone or in combination. Other methylation sensitive restriction 

20 endonucleases will be known to those of skill in the art and include, but are not 

limited to SacII, EagI, and BstUI, for example. An "isoschizomer" of a methylation 
sensitive restriction endonuclease is a restriction endonuclease which recognizes the 
same recognition site as a methylation sensitive restriction endonuclease but which 
cleaves both methylated and unmethylated CGs. One of skill in the art can readily 

25 determine appropriate conditions for a restriction endonuclease to cleave a nucleic 
acid (see Sambrook et ah, Molecular Cloning: a Laboratory Manual , Cold Spring 
Harbor Press, 1989). Without being bound by theory, actively transcribed genes 
generally contain fewer methylated CGs than in other genes. 

In one embodiment of the invention, a nucleic acid of interest is cleaved with a 
30 methylation sensitive endonuclease. In one aspect, cleavage with the methylation 
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sensitive endonuclease creates a sufficient overhang on the nucleic acid of interest. 
Following cleavage with the isoschizomer, the cleavage product can still have a 
sufficient overhang. An "overhang" refers to nucleic acid having two strands wherein 
the strands end in such a manner that a few bases of one strand are not base paired to 

5 the other strand. A "sufficient overhang" refers to an overhang of sufficient length to 
allow specific hybridization of an oligonucleotide of interest. In one embodiment, a 
sufficient overhang is at least two bases in length. In another embodiment, the 
sufficient overhang is four or more bases in length. An overhang of a specific 
sequence on the nucleic acid of interest may be desired in order for an oligonucleotide 

10 of interest to hybridize. In this case, the isoschizomer can be used to create the 
overhang having the desired sequence on the nucleic acid of interest. 

In another aspect of this embodiment, the cleavage with a methylation 
sensitive endonuclease results in a reaction product of the nucleic acid of interest that 
has a blunt end or an insufficient overhang. In this embodiment, an isoschizomer of 
15 the methylation sensitive restriction endonuclease can create a sufficient overhang on 
the nucleic acid of interest. "Blunt ends" refers to a flush ending of two stands, the 
sense stand and the antisense strand, of a nucleic acid. 

Once a sufficient overhang is created on the nucleic acid of interest, an 
oligonucleotide is ligated to the nucleic acid cleaved of interest which has been 

20 cleaved by the methylation specific restriction endonuclease. "Ligation" is the 
attachment of two nucleic acid sequences by base pairing of substantially 
complementary sequences and/or by the formation of covalent bonds between two 
nucleic acid sequences. Ln one aspect of the present invention, an "oligonucleotide" is 
a nucleic acid sequence of about 2 up to about 40 bases in length. It is presently 

25 preferred that the oligonucleotide is from about 15 to 35 bases in length. 

In one embodiment, an adaptor is utilized to create DNA ends of desired 
sequence and overhang. An "adaptor" is a double-stranded nucleic acid sequence 
with one end that has a sufficient single-stranded overhang at one or both ends such 
that the adaptor can be ligated by base-pairing to a sufficient overhang on a nucleic 
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acid of interest that has been cleaved by a methylation sensitive restriction enzyme or 
an isoschizomer of a methylation sensitive restriction enzyme. Adaptors can be 
obtained commercially, or two oligonucleotides can be utilized to form an adaptor 
Thus, in one embodiment, two oligonucleotides are used to form an adaptor; these 
5 oligonucleotides are substantially complementary over their entire sequence except for 
the region(s) at the 5' and/or 3' ends that will form a single stranded overhang. The 
single stranded overhang is complementary to an overhang on the nucleic acid cleaved 
by a methylation sensitive restriction enzyme or an isoschizomer of a methylation 
sensitive restriction enzyme, such that the overhang on the nucleic acid of interest will 
10 base pair with the 3* or 5* single stranded end of the adaptor under appropriate 

conditions. The conditions will vary depending on the sequence composition (GC vs 
AT), the length, and the type of nucleic acid (see Sambrook et al., Molecular Cloning: 
a Laboratory Manual , 2nd Ed.; Cold Spring Harbor Laboratory Press, Plainview, NY, 
1998). 

15 Following the ligation of the oligonucleotide, the nucleic acid of interest is 

amplified using a primer complementary to the oligonucleotide. Specifically, the term 
"primer" as used herein refers to a sequence comprising two or more deoxyribo- 
nucleotides or ribonucleotides, preferably more than three, and more preferably more 
than eight, wherein the sequence is capable of initiating synthesis of a primer 

20 extension product, which is substantially complementary to a nucleic acid such as an 
adaptor or a ligated oligonucleotide. Environmental conditions conducive to synthesis 
include the presence of nucleoside triphosphates and an agent for polymerization, such 
as DNA polymerase, and a suitable temperature and pH. The primer is preferably 
single stranded for maximum efficiency in amplification, but may be double stranded. 

25 If double stranded, the primer is first treated to separate its strands before being used 
to prepare extension products. In one embodiment, the primer is an oligodeoxyribo- 
nucleotide. The primer must be sufficiently long to prime the synthesis of extension 
products in the presence of the inducing agent for polymerization. The exact length of 
primer will depend on many factors, including temperature, buffer, and nucleotide 
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composition. The oligonucleotide primer typically contains 12-20 or more 
nucleotides, although it may contain fewer nucleotides: 

Primers of the invention are designed to be "substantially" complementary to 
each strand of the oligonucleotide to be amplified and include the appropriate G or C 
5 nucleotides as discussed above. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands under conditions which 
allow the agent for polymerization to perform. In other words, the primers should 
have sufficient complementarity with a 5' and 3' oligonucleotide to hybridize 
therewith and permit amplification of CpG containing nucleic acid sequence. 

10 Primers of the invention are employed in the amplification process which is an 

enzymatic chain reaction that produces exponential quantities of target locus relative 
to the number of reaction steps involved (e.g., polymerase chain reaction or PCR). 
Typically, one primer is complementary to the negative (-) strand of the locus and the 
other is complementary to the positive (+) strand. Annealing the primers to denatured 

15 nucleic acid followed by extension with an enzyme, such as the large fragment of 
DNA Polymerase I (Klenow) and nucleotides, results in newly synthesized + and - 
strands containing the target locus sequence. Because these newly synthesized 
sequences are also templates, repeated cycles of denaturing, primer annealing, and 
extension results in exponential production of the region (i.e., the target locus 

20 sequence) defined by the primer. The product of the chain reaction is a discrete 
nucleic acid duplex with termini corresponding to the ends of the specific primers 
employed. 

The oligonucleotide primers of the invention may be prepared using any 
suitable method, such as conventional phosphotriester and phosphodiester methods or 
25 automated embodiments thereof. In one such automated embodiment, diethylphos- 
phoramidites are used as starting materials and may be synthesized as described by 
Beaucage, et al. (Tetrahedron Letters, 22:1859-1862, 1981). One method for 
synthesizing oligonucleotides on a modified solid support is described in U.S. Patent 
No. 4,458,066. 
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Where the CpG-containing nucleic acid sequence of interest contains two 
strands, it is necessary to separate the strands of the nucleic acid before it can be used 
as a template for the amplification process. Strand separation can be effected either as 
a separate step or simultaneously with the synthesis of the primer extension products. 
5 This strand separation can be accomplished using various suitable denaturing 

conditions, including physical, chemical, or enzymatic means, the word "denaturing" 
includes all such means. One physical method of separating nucleic acid strands 
involves heating the nucleic acid until it is denatured. Typical heat denaturation may 
involve temperatures ranging from about 80° to 105°C for times ranging from about 1 

10 to 10 minutes. Strand separation may also be induced by an enzyme from the class of 
enzymes known as helicases or by the enzyme RecA, which has helicase activity, and 
in the presence of riboATP, is known to denature DNA. The reaction conditions 
suitable for strand separation of nucleic acids with helicases are described by Kuhn 
Hoffmann-Berling (CSH-Quantitative Biology, 43:63, 1978) and techniques for using 

15 RecA are reviewed in C. Radding (Ann. Rev. Genetics, J_6:405-437, 1982). 

When complementary strands of nucleic acid or acids are separated, regardless 
of whether the nucleic acid was originally double or single stranded, the separated 
strands are ready to be used as a template for the synthesis of additional nucleic acid 
strands. This synthesis is performed under conditions allowing hybridization of 

20 primers to templates to occur. Generally synthesis occurs in a buffered aqueous 
solution, generally at a pH of about 7-9. Preferably, a molar excess (for genomic 
nucleic acid, usually about 108:1 primer: template) of the two oligonucleotide primers 
is added to the buffer containing the separated template strands. It is understood, 
however, that the amount of complementary strand may not be known if the process 

25 of the invention is used for diagnostic applications, so that the amount of primer 

relative to the amount of complementary strand cannot be determined with certainty. 
As a practical matter, however, the amount of primer added will generally be in molar 
excess over the amount of complementary strand (template) when the sequence to be 
amplified is contained in a mixture of complicated long-chain nucleic acid strands, a 

30 large molar excess is preferred to improve the efficiency of the process. 
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The deoxyribonucleoside triphosphates dATP, dCTP, dGTP, and dTTP are 
added to the synthesis mixture, either separately or together with the primers, in 
adequate amounts and the resulting solution is heated to about 90°-100°C from about 
1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period, the 

5 solution is allowed to cool to approximately room temperature, which is preferable for 
the primer hybridization. To the cooled mixture is added an appropriate agent for 
effecting the primer extension reaction (called herein "agent for polymerization"), and 
the reaction is allowed to occur under conditions known in the art. The agent for 
polymerization may also be added together with the other reagents if it is heat stable. 

10 This synthesis (or amplification) reaction may occur at room temperature up to a 

temperature above which the agent for polymerization no longer functions. Thus, for 
example, if DNA polymerase is used as the agent, the temperature is generally no 
greater than about 40°C. Most conveniently the reaction occurs at room temperature. 

The agent for polymerization may be any compound or system which will 
function to accomplish the synthesis of primer extension products, including enzymes. 
Suitable enzymes for this purpose include, for example, E. coli DNA polymerase I, 
Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available 
DNA polymerases, polymerase muteins, reverse transcriptase, and other enzymes, 
including heat-stable enzymes (i.e., those enzymes which perform primer extension 
after being subjected to temperatures sufficiently elevated to cause denaturation such 
as Taq DNA polymerase, and the like). Suitable enzymes will facilitate combination 
of the nucleotides in the proper manner to form the primer extension products which 
are complementary to each locus nucleic acid strand. Generally, the synthesis will be 
initiated at the 3* end of each primer and proceed in the 5' direction along the 
template strand, until synthesis terminates, producing molecules of different lengths. 
There may be agents for polymerization, however, which initiate synthesis at the 5' 
end and proceed in the other direction, using the same process as described above. 

Preferably, the method of amplifying is by PCR, as described herein and as is 
commonly used by those of ordinary skill in the art. However, alternative methods of 
30 amplification have been described and can also be employed. PCR techniques and 



20 
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many variations of PGR are known. Basic PCR techniques are described by Saiki et 
al. (1988 Science 239:487-491) and by U.S. Pat. Nos. 4,683,195, 4,683,202 and 
4,800,159, which are incorporated herein by reference. 

The conditions generally required for PCR include temperature, salt, cation, 
5 pH and related conditions needed for efficient copying of the master-cut fragment. 
PCR conditions include repeated cycles of heat denaturation (i.e. heating to at least 
about 95. degree. C.) and incubation at a temperature permitting primer: adaptor 
hybridization and copying of the master-cut DNA fragment by the amplification 
enzyme. Heat stable amplification enzymes like the pwo, Thermus aquaticus or 
10 Thermococcus litoralis DNA polymerases are commercially available which eliminate 
the need to add enzyme after each denaturation cycle. The salt, cation, pH and related 
factors needed for enzymatic amplification activity are available from commercial 
manufacturers of amplification enzymes. 

As provided herein an amplification enzyme is any enzyme which can be used 
15 for in vitro nucleic acid amplification, e.g. by the above-described procedures. Such 
amplification enzymes include pwo, Escherichia coli DNA polymerase I, Klenow 
fragment of E. coli DNA polymerase I, T4 DNA polymerase, T7 DNA polymerase, 
Thermus aquaticus (Taq) DNA polymerase, Thermococcus litoralis DNA polymerase, 
SP6 RNA polymerase, T7 RNA polymerase, T3 RNA polymerase, T4 polynucleotide 
20 kinase, Avian Myeloblastosis Virus reverse transcriptase, Moloney Murine Leukemia 
Virus reverse transcriptase, T4 DNA ligase, E. coli DNA ligase or Q.beta. replicase. 
Preferred amplification enzymes are the pwo and Taq polymerases. The pwo enzyme 
is especially preferred because of its fidelity in replicating DNA. 

25 Once amplified, the nucleic acid can be attached to a solid support, such as a 

membrane, and can be hybridized with any probe of interest, to detect any nucleic acid 
sequence. Several membranes are known to one of skill in the art for the adhesion of 
nucleic acid sequences. Specific non-limiting examples of these membranes include 
nitrocellulose (N1TROPURE) or other membranes used in for detection of gene 
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expression such as polyvinylchloride, diazotized paper and other commercially 
available membranes such as GENESCREEN, ZETAPROBE (Biorad), and NYTRAN 
Methods for attaching nucleic acids to these membranes are well known to one of skill 
in the art. Alternatively, screening can be done in a liquid phase. 

5 In nucleic acid hybridization reactions, the conditions used to achieve a 

particular level of stringency will vary, depending on the nature of the nucleic acids 
being hybridized. For example, the length, degree of complementarity, nucleotide 
sequence composition (e.g., GC v. AT content), and nucleic acid type (e.g., RNA v. 
DNA) of the hybridizing regions of the nucleic acids can be considered in selecting 

10 hybridization conditions. An additional consideration is whether one of the nucleic 
acids is immobilized, for example, on a filter. 

An example of progressively higher stringency conditions is as follows: 2 x 
SSC/0.1% SDS at about room temperature (hybridization conditions); 0.2 x 
SSC/0.1% SDS at about room temperature (low stringency conditions); 0.2 x 

15 SSC/0.1% SDS at about 42°C (moderate stringency conditions); and 0.1 x SSC at 
about 68°C (high stringency conditions). Washing can be carried out using only one 
of these conditions, e.g., high stringency conditions, or each of the conditions can be 
used, e.g., for 10-15 minutes each, in the order listed above, repeating any or all of the 
steps listed. However, as mentioned above, optimal conditions will vary, depending 

20 on the particular hybridization reaction involved, and can be determined empirically. 
In general, conditions of high stringency are used for the hybridization of the probe of 
interest. 

The probe of interest can be detectably labeled, for example, with a 
radioisotope, a fluorescent compound, a bioluminescent compound, a 
25 chemi luminescent compound, a metal chelator, or an enzyme. Those of ordinary skill 
in the art will know of other suitable labels for binding to the probe, or will be able to 
ascertain such, using routine experimentation. 
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In one embodiment, representational difference analysis (RDA, see Lisitsyn et 
al., Science 259:946-951, 1993, herein incorporated by reference) can be performed 
on CpG-containing nucleic acid following MCA. MCA utilizes kinetic and 
subtractive enrichment to purify restriction endonuclease fragments present in one 

5 population of nucleic acid fragments but not in another. Thus, RDA enables the 
identification of small differences between the sequences of two nucleic acid 
populations. RDA uses nucleic acid from one population as a "tester" and nucleic 
acid from a second population as a "driver" in order to clone probes for single copy 
sequences present in (or absent from) one of the two populations. In one embodiment, 

10 nucleic acid from a "normal" individual or sample, not having a disorder such as a 
cell-proliferative disorder is used as a "driver," and nucleic acid from an "affected" 
individual or sample, having the disorder such as a cell proliferative disorder is used 
as a "tester." In one embodiment, the nucleic acid used as a "tester" is isolated from 
an individual having a cell proliferative disorder such as low grade astrocytoma, 

15 anaplastic astrocytoma, glioblastoma, medulloblastoma, gastric cancer, colorectal 
cancer, colorectal adenoma, acute myelogenous leukemia, leukemia, lung cancer, 
renal cancer, breast cancer, prostate cancer, endometrial cancer and neuroblastoma. 
The nucleic acid used as a "driver" is thus normal astrocytes, normal glial cells, 
normal brain cells, normal gastric cells, normal colorectal cells, normal leukocytes, 

20 normal lung cells, normal kidney cells, normal breast cells, normal prostate cells, 

normal uterine cells, and normal neurons, respectively. In an additional embodiment, 
the nucleic acid used as a "driver" is isolated from an individual having a cell 
proliferative disorder such as low grade astrocytoma, anaplastic astrocytoma, 
glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal adenoma, 

25 acute myelogenous leukemia, leukemia, lung cancer, renal cancer, breast cancer, 
prostate cancer, endometrial cancer and neuroblastoma. The nucleic acid used as a 
"tester" is thus normal astrocytes, normi; glial cells, normal brain cells, normal gastric 
cells, normal colorectal cells, normal leukocytes, normal lung cells, normal kidney 
cells, normal breast cells, normal prostate cells, normal uterine cells, and normal 

30 neurons, respectively. One of skill in the art will readily be able to identify the 



BNSDOCID. <WO 02068694A1 J_> 



WO 02/068694 



PCT/US02/05681 



34 

"tester" nucleic acid useful with to identify methylated nucleic acid sequences in 
given "driver" population. 

The materials for use in the assay of the invention are ideally suited for the 
preparation of a kit. Therefore, in accordance with another embodiment of the present 

5 invention, there is provided a kit it useful for the detection of a cellular proliferative 
disorder in a subject having or at risk for said cellular proliferative disorder. Invention 
kits include a carrier means compartmentalized to receive a sample in close 
confinement therein, one or more containers comprising a first container containing a 
reagent which modifies unmethylated cytosine and a second container containing 

10 primers for amplification of a CpG-containing nucleic acid, wherein the primers 
distinguish between modified methylated and nonmethylated nucleic acid, and 
optionally, a third container containing a methylation sensitive restriction 
endonuclease. Primers contemplated for use in accordance with the invention include 
those that would amplify sequences or fragments thereof as set 'forth in SEQ ED NOs: 

15 1-42. 

Carrier means are suited for containing one or more container means such as 
vials, tubes, and the like, each of the container means comprising one of the separate 
elements to be used in the method. In view of the description provided herein of 
invention methods, those of skill in the art can readily determine the apportionment of 

20 the necessary reagents among the containiner means. For example, one of the 

container means can comprise a container containing an oligonucleotide for ligation to 
nucleic acid cleaved by a methylation sensitive restriction endonuclease. One or more 
container means can also be included comprising a primer complementary to the 
oligonucleotide. In addition, one or more container means can also be included which 

25 comprise a methylation sensitive restriction endonuclease. One or more container 
means can also be included containing an isoschizomer of said methylation sensitive 
restriction enzyme. 

It should be noted that as used herein and in the appended claims, the singular 
forms "a," "and," and "the" include plural referents unless the context clearly dictates 
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otherwise. Thus, for example, reference to "a cell" includes a plurality of such cells 
and reference to "the restriction enzyme" includes reference to one or more restriction 
enzymes and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have 
5 the same meaning as commonly understood to one of ordinary skill in the art to which 
this invention belongs. Although any methods, devices and materials similar or 
equivalent to those described herein can be used in the practice or testing of the 
invention, the preferred methods, devices and materials are now described. 

All publications mentioned herein are incorporated herein by reference in full 
10 for the purpose of describing and disclosing the methodologies which are described in 
the publications which might be used in connection with the presently described 
invention. The publications discussed above and throughout the text are provided 
solely for their disclosure prior to the filing date of the present application. Nothing 
herein is to be construed as an admission that the inventors are not entitled to antedate 
15 such disclosure by virtue of prior invention. 

The following example is intended to illustrate but not to limit the invention in 
any manner, shape, or form, either explicitly or implicitly. While they are typical of 
those that might be used, other procedures, methodologies, or techniques known to 
those skilled in the art may alternatively be used. 

20 EXAMPLE 1 

Collection and Preparation of Pancreatic Cell Lines 

The following pancreatic adenocarcinoma cell lines were used: PL3 and PL8 
(both CIMP-cell lines (CpG island methylator phenotype) were from Dr. Elizabeth 
Jaffe, John Hopkins University); CAPAN1, CAPAN2, Panel , Hs766T, and MiaPaca2 
25 (from the American Type Culture Collection, Rockville, MD) and Colo357 (from the 
European Collection of Animal Cell Cultures, Salisbury, United Kingdom). In 
addition, seventeen pancreatic cancer xenografts were selected at random from a total 
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of 90 xenografts, which were established from the primary carcinomas as described in 
Ueki, et al., Cancer Res., 60:1835-1839, 2000. Forty-seven primary pancreatic 
adenocarcinomas, 15 normal pancreata, 5 pancreata from patients with chronic 
pancreatitis, and a panel of normal tissues were obtained from the resected surgical 

5 specimens at The Johns Hopkins Medical Institutions, Baltimore, MD. Frozen tissues 
or paraffin-embedded tissues were microdissected to obtain >40% neoplastic 
cellularity in the primary pancreatic adenocarcinomas, and 3 of the 15 frozen normal 
pancreatic tissues were also microdissected to enrich the normal ductal epithelium. 
DNA was extracted from microdissected primary pancreatic adenocarcinomas and 

10 normal tissues as well as from lymphocytes of four cancer-free individuals using 
standard methods. 

EXAMPLE 2 

Methylated CpG Island Amplification / Representational 
Difference Analysis (MCA/RDA) 

15 MCA/RDA was performed as described by Toyota et al., Cancer Res., 

59:2307-2312, 1999, modifying such procedure to increase the efficiency by digesting 
5\xg of DNA with Smal and Xmal (New England Biolabs). The restriction fragments 
were then ligated to RMCA adapter and amplified by PCR in 10 mM Tris-HCl (pH 
8.3), 1.5 mM MgC12, 50 mM KC1, 0.5 M betaine, 2% DMSO, 200 ^iM each 

20 deoxynucleotide triphosphate, 100 pmol of RMCA 24mer primer and 15 units of Taq 
polymerase (Life Technologies, Inc.) in a final reaction volume of 100 jal. The 
reaction mixture was then incubated at 72°C for 5 min and at 95°C for 3 min, and then 
subjected to 25 cycles of 1 min at 95°C and 3 min at 77°C followed by a final 
extension of 10 min at 77°C. Betaine was included in the PCR reaction to help 

25 amplify the methylated templates at a higher annealing temperature (77°C). The 
combination of betaine and DMSO can uniformly amplify a mixture of DNA with 
different GC content (Baskaran, et al., Genome Res. 6:633-638, 1996). These 
modifications might have enhanced the amplification of distinct MICPs (methylated 
in carcinoma of the pancreas) instead of Alu repetitive sequences that accounted for 

30 60% of the recovered clones using the original protocol (Toyota et al., Cancer Res. 
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59:2307-23 12, 1 999). The MCA amplicon from either the pancreatic cancer cell line 
PL3 or PL8 was used as the tester for RDA, and a MCA amplicon generated from a 
mixture of DNA from the normal pancreata of six different patients was used as the 
driver. RDA was performed on these MCA amplicons using different adapters, 
5 JMCA and NMCA. Sequences of adapters used for MCA/RDA are listed in Table 3 A 
and 3B and are available at http://pathology2.jhu.ediL/pancreas/prim0425.htm, which 
website is incorporated herein by reference. After the third round of competitive 
hybridization and selective amplification, the RDA difference products of second and 
third round amplifications were cloned into pBluescript II plasmid vector (Stratagene). 

10 EXAMPLE 3 

DNA Sequencing of Clones and Dot Blot Hybridization 

The clones recovered from each cell line after MCA/RDA were 
amplified with T3 and T7 primers and then sequenced using KS primer as 
recommended by the manufacturer (Sequitherm Excel; Epicentre Technologies). To 

15 determine the methylation status of MCA/RDA MICPs in pancreatic cancer and 
normal pancreas, MICPs were screened by hybridizing them to a dot blot of MCA 
products of pancreatic cancers and normal pancreata. Plasmid DNA containing each 
independent clone was prepared and digested with Smal. DNA fragments were 
recovered from agarose gel and used as a probe for dot blot hybridization. Aliquots 

20 ( 1 nl) of the mixture of 1 OXSSC and MCA products from the driver and from the 
tester (PL3 and PL8) both before and after each of the three rounds of RDA 
competitive hybridization/selective amplification were blotted onto nylon membranes 
in duplicate. Similarly, MCA products from six pancreatic cell lines (CAPAN1, 
CAPAN2, Panel, Hs766T, MiaPaca2, and Colo357) and from eight other normal 

25 pancreata were also blotted onto the membranes. The membranes were hybridized 
with 32P-labeled probes overnight, washed, and exposed to a Kodak X-ray film. 
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EXAMPLE 4 

Bisulfite Modification^ Bisulfite-Modified Genomic Sequencing, and 
Metbvlation-Specific PCR (MSP) 

The bisulfite treatment was carried out for 16 h at 50°C using l(ig of genomic 

5 DNA, as described in Ueki, et al., Cancer Res., 60:1835-1839, 2000. Genomic 

sequencing was performed on bisulfite-treated DNA to examine the methylation status 
of 10-20 CpG dinucleotides located in and/or around Smal sites of each clone in 22 
pancreatic tissues (8 cancer cell lines, 6 primary adenocarcinomas, and 8 normal 
pancreata; Ueki, et al., Cancer Res. 60:1835-1839, 2000). Genomic sequencing of the 

10 coding sequence of cyclin G was also performed in PL8. The level of methylation of 
each clone was determined by quantifying the level of methylation of each CpG site 
by comparing the intensity of unconverted cytosine with that of cytosine plus 
thymidine. Generally, in pancreatic cancer cell lines, the level of methylation 
observed at each CpG dinucleotide was consistent throughout the CpG island. 

15 Therefore, the average level of methylation of each sequence was graded and placed 
into one of 4 grades: 0-10%, 1 1-30%, 31-70%, and 71-100%. MSP was performed 
as described in Herman, et al., Proc. Natl. Acad. Sci. USA 93:9821-9826, 1996, and to 
acquire optimal specificity, each primer pair contained four to six CpG sites, and high 
specific annealing temperatures were used. The primers and the specific annealing 

20 temperatures for each clone are listed in Table 3 and are available at 

http://pathology2.jhu.edu/pancreas/prim0425.htm, which website is incorporated 
herein by reference. If validated MSP primers sets specific for methylated and 
unmethylated templates revealed that there was only amplification of methylated 
templates, the samples are presumed to be 100% methylated. Methylated and 

25 unmethylated templates were identified by bisulfite-modified sequencing. In 

describing MSP results performed on CpG islands that were normally unmethylated in 
non-neoplastic pancreas, a cancer sample was termed "methylated" if MSP yielded 
any methylated templates. 
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EXAMPLE 5 

Reverse Transcription-PCR (RT-PCR) and 
S-aza-Z'-deoxvcvtidine (5Aza-dC) Treatment 

Five pancreatic cancer cell lines (PL3, PL8, CAJPAN2, Panel, and MiaPaca2) 
5 and four normal pancreata were used for RT-PCR analysis. The cell lines were 
treated with demethylating agent 5Aza-dC (Sigma Chemical Co.) at a final 
concentration of 1 for 5 days. Total RNA was prepared using TRIzol (Life 
Technologies, Inc.), reverse-transcribed and amplified. As a control for cDNA 
integrity, glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) was also 
10 amplified. Primer sequences for RT-PCR are listed in Table 3 and are available at 
http://pathology2.jhu.edu/pancreas/prim0425.htm, which website is incorporated 
herein by reference. 

EXAMPLE 6 
Statistics 

15 The primary outcome variable was the observed number of 7 MICP loci found 

to be methylated in 64 pancreatic cancers. Wilcoxon's rank-sum test compared the 
observed number of methylated loci by tumor differentiation (poorly versus well or 
moderately differentiated), lymph node status (0 or 1 versus >1 node positive), and 
prior CIMP classification (CIMP positive versus CIMP negative). Simple linear 

20 regression assessed the relationship between the observed number of methylated loci 
and these covariates: age, age squared, and tumor diameter (in cm). Multivariate 
linear regression assessed the simultaneous contribution of the clinicopathological and 
demographic variables to the observed number of methylated loci. All of the tests 
were two-sided. A P of < 0.05 signified statistical significance. 
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EXAMPLE 7 

Patient Population and Tissue Samples 

Normal and tumor specimens were obtained from pancreatic adenocarcinomas 
resected at The Johns Hopkins Hospital. Pancreatic cancer xenografts were 

5 established from the primary carcinomas as previously described in Caldas, et al., 

Nature Genet. 8:27-31, 1994, and carcinoma and normal tissues were stored at -70°C. 
Thirty-two xenografts were selected at random. Three MSI carcinoma xenografts 
were added, as reported previously, as well as another MSI primary carcinoma 
(Goggins, et al., Am. J. of Pathology 152:1501-1507, 1998). Genomic DNA was 

10 prepared from 35 xenografts and 9 pancreatic adenocarcinoma cell lines (BxPc3, 
Capanl, Capan2, Panel, CFPAC1, MiaPaca2, Hs766T (all from ATCC, Rockville, 
MD), Col0357 (from ECACC, Salisbury, UK) and PL45, a low passage cell line 
established in the Goggin Laboratory at John Hopkins University School of 
Medicine). Where available, DNA was obtained from primary pancreatic cancer 

15 tissue frozen at -80C since Whipple resection. Frozen tissues were microdissected to 
obtain DNA from normal pancreatic tissue and the pancreatic carcinoma. 

Patient records were reviewed to determine a history of cigarette smoking, 
diabetes mellitus, and prognosis, and a family history of pancreatic cancer. These data 
were reviewed in the context of the DNA methylation data. 

20 EXAMPLE 8 

Bisulfite Modification and Genomic Sequencing 

The bisulfite treatment was carried out for 16h at 56°C on 1 jag of genomic 
DNA, according to the procedure of Herman et al, Proc. Natl. Acad. Sci. USA 
93:9821-9826, 1996. Modified DNA was purified and eluted into 50\±\ of LoTE 
25 buffer. The primers used for genomic sequencing of bisulfite-treated DNA are listed 
in Table 3 and are available at http://pathology2.jhu.edu/pancreas/prim0425.htm, 
which website is incorporated herein by reference. PCR was performed on l-2|j,l of 
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bisulfite-treated DNA, and prior to sequencing, PCR reactions were incubated with 
exonuclease I and shrimp alkaline phosphatase (Amersham, according to the 
manufacturer recommendations). Sequencing of PCR products was performed in 
microtiter plates as recommended by the manufacturer (Sequitherm Excel, Epicentre 
5 Technologies, Madison, WI). 

EXAMPLE 9 

Methvlation-Specific PCR (MSP) Assay 

The methylation status of each gene was determined by MSP as described by 
Herman et al, Proc. Natl. Acad. Sci. USA 93:9821-9826, 1996, in which the sequence 

10 difference of bisulfite-treated DNA was detected by PCR using primers specific for 
either the methylated or for the unmethylated DNA. Primer sequences and the 
specific annealing temperatures for 13 genes are listed in Table 3 and are available at 
http://pathology2.jhu.edu/pancreas/prim0425.htm, which website is incorporated 
herein by reference. MSP was performed on ljil of bisulfite-treated DNA under the 

15 conditions as follows: 95°C for 3 min; then 35-40 cycles of 95°C for 30s, the specific 
annealing temperature for 30s, and 72°C for 30s; and a final extension of 4 min at 
72°C. Five |al of each PCR product were directly loaded onto 3% agarose gels or 10% 
acrylamide gels, stained with ethidium bromide and visualized under UV illumination. 
All PCR reactions were performed with positive controls for both unmethylated and 

20 methylated alleles, and no DNA control. Finally, 3-6 CpG sites were included in each 
primer pair and the specific annealing temperatures were used for each gene to obtain 
optimal specificity. 

EXAMPLE 10 

Identification of differentially methylated sequences 

25 The strategy of MCA/RDA has been previously reported (Toyota, et aL, 

Cancer Res. 59:2307-2312, 1997). Ninety-six randomly selected clones recovered 
from each cell line were subjected to DNA sequencing and 66 clones were revealed to 
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be independent. Only 3 clones contained Alu-repetitive sequences. The subsequent 
probing of labeled clones to MCA products of tester and driver by dot-blot 
hybridization revealed that 43 of 66 clones (MICP1-43) were differentially methylated 
in the tester compared to the driver (Fig, 1). These 43 clones were also variably 

5 methylated in the other 6 pancreatic cancer cell lines examined. A description of 

these 43 differentially methylated clones is shown in Table 1 . All of the 43 clones had 
a GC content of greater than 50% and 41 (95%) had a sequence uniqueness sustaining 
the criteria of CpG island (Gardiner-Garden & Frommer, J. Mol. biol. 196:261-282, 
1987). The DNA homology search of each clone with the BLAST program (National 

10 Center for Biotechnology Information) demonstrated that 84% (36) of the 43 clones 
had significant homologies to known human sequences, including 8 clones matched to 
human gene sequences and 9 clones matched to human ESTs. Five clones were also 
matched or contained a part of CpG islands isolated previously (Toyota, et al., Cancer 
Res. 59:2307-2312, 1997, Cross, et al., Nat. Genet., 6:236-244, 1994) and 14 clones 

15 had significant homology to High-Throughput Genome Sequences in the three 
International Nucleotide Sequence databases: DDBJ, EMBL and GenBank. The 
remaining 7 had no significant homology to known sequences. MICP1 corresponded 
to the 3' noncoding region of the human homeobox gene CSX, MICP2 corresponded 
to exon 1 and intron 1 of the human Cyclin G, MICP3 corresponded to 5 7 region of 

20 the human endothelin-converting enzyme-like 1 gene ECEL1 , MICP4 corresponded to 
intron 1 of the human glutamate decarboxylase 1 gene GAD1, MICP5 corresponded 
to exon 7 to exon 9 of the human ICAM5, MICP6 corresponded to intron 2 to intron 3 
of the human monocarboxylate transporter 3 gene MCT3, MICP7 corresponded to the 
5' region of the human B-cell specific transcriptional factor gene PAX5, MICP8 

25 corresponded to intron 1 and exon 2 of the human Preproenkephalin gene (ppENK) 
(Fig. 2). Interestingly, 3 clones (MICP1, 17 and 22) matched to CpG islands originally 
recovered from colorectal cancer cell line using the same technique (named MINT 23, 
20 and 32, respectively) (Toyota, et al., Cancer Res. 59:2307-2312, 1997). 
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EXAMPLE 1 1 

Characterization of the sequences 

For 30 of the 43 clones, methylation was detected in 2 or more out of 8 normal 
pancreata by dot blot analysis, suggesting that these clones could be frequently 

5 methylated in normal pancreas. Therefore, only the remaining 13 clones were further 
analyzed by bisulfite sequencing. For seven of the clones (Cyclin G, ppENK, 
MICP20, 23, 33, 35 and 36), methylation was restricted to pancreatic cancers (Fig. 4A 
and 4B). In the case of 5 clones (MCT3, PAX5, MICP15, 16 and 38), methylation 
was detectable in DNA from normal pancreata as well as DNA from cancer tissues 

10 (Fig. 3C). In the thirteenth clone (MICP42), cytosines at CpG sites were similarly 
methylated in both pancreatic cancers and normal pancreata. The sequence 
uniqueness of this clone also did not satisfy the criteria for CpG islands. A summary 
of the level of methylation of each clone is shown in Fig. 3D. There was little 
individual variation in the level of methylation of these 6 clones (MCT3, PAX5, 

15 MICP15, 16, 38 and 42) in normal pancreata from 8 patients at the age from 34 to 84 
years old (Fig. 3 and data not shown). Interestingly, the methylation of these CpG 
islands in normal pancreata was sometimes heterogeneous. For example, although the 
Smal site of MICP15 was not methylated in normal pancreata, 2 CpG sites near the 
Smal site were methylated (Fig. 3C). A summary of the results of MCA/RDA and 

20 bisulfite sequencing is also provided in Fig. 4. 

In order to identify low level of methylation of identified clones in pancreatic 
tissues, MSP primers were designed for the 7 clones (Cyclin G, ppENK, MICP20, 23, 
33, 35 and 36) differentially methylated in pancreatic cancers by bisulfite sequencing. 
The methylation status of these clones was examined in 1 5 primary pancreatic 
25 adenocarcinomas, 5 DNA samples from pancreata with chronic pancreatitis as well as 
14 normal pancreata including three specimens enriched in normal ductal epithelium. 
For MICP36, MSP detected methylation in DNA of three pancreatic cancer cell lines 
but not in any other specimens examined. Aberrant methylation of the remaining 6 
clones was detected in 7% to 87% of primary pancreatic adenocarcinomas, while MSP 
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confirmed a lack of methylation of these 6 clones in DNA from 14 normal pancreata 
(Fig. 5 and Table 2A and 2B). DNA from one chronic pancreatitis tissue containing 
pancreatic intraductal neoplasia (PanIN) (Hruban, et ah, Clin. Cancer Res. 6:2969- 
2972, 2000) harbored methylated templates of Cyclin G, ppENK and MICP20 and 
5 DNA from another chronic pancreatitis sample also had methylated templates of 
ppENK. 

Because tissue-specific methylation differences can be found in normal tissues 
(Liang, et al., Genomics 53:260-268, 1998), the clones found that were aberrantly 
methylated in pancreatic cancer, as compared to normal pancreas, were also analyzed 

10 to see if they are methylated in other normal tissues. By MSP, MICPs 33 and 35 were 
not methylated in any normal gastrointestinal tissues, while ppENK , MICP20 and 23 
were methylated in DNA samples from normal gastric, duodenal and colonic mucosae 
(Fig. 5 and Table 2A and 2B). Amplification of methylated templates of these clones 
was always weak in normal mucosae compared to the primary pancreatic 

15 adenocarcinomas (Fig. 3), suggesting that there were few methylated DNA templates 
in these mucosae. 

EXAMPLE 12 

Expression of Cyclin G and ppENK in pancreatic cancer 

To determine whether methylation of these clones resulted in loss or decrease 
20 of gene expression, Cyclin G and ppENK were further examined using RT-PCR in 5 
and 4 pancreatic cell lines, respectively. Partial methylation (-50%) of the 5' CpG 
island of Cyclin G in PL8 (Fig. 4C) was associated with decreased expression of 
Cyclin G by RT-PCR. The 5' CpG island of Cyclin G was not methylated in a panel 
of normal pancreata. Cyclin G was expressed in 4 normal pancreata by RT-PCR. 
25 Treatment with 5Aza-dC restored the expression of Cyclin G in PL8 (Fig 4C). Using 
RT-PCR, it was found that ppENK was expressed in normal pancreata but not in any 
of the 4 pancreatic cancer cell lines examined in which 5'CpG island of this gene was 
methylated. 5Aza-dC treatment restored ppENK expression in all 4 cell lines (Fig. 
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4D). Thus, hypermethylation of the 5' CpG islands of Cyclin G and ppENK is 
coincides with decreased expression in pancreatic cancers. 

EXAMPLE 13 

Determination of Methylated ppENK by Methylation Specific PCR of Pancreatic 
5 Fluid as a Specific Marker of Pancreatic Cancer 

Pancreatic fluid obtained from patients undergoing pancreatic surgical 
resection from 38 patients with pancreatic cancer, 7 with chronic pancreatitis, 4 with 
miscellaneous tumors (islet cell, serous cyst adenoma, and lymphoma), 9 with 
ampullary cancers, 9 with IPMNs (intraductal papillary mucinous neoplasms) and 3 
10 from bile duct cancers was analyzed. Methylated ppENK was detected in 55% of 

pancreatic cancer fluids, none in the chronic pancreatitis or other miscellaneous tumor 
group, 2 of 9 with EPMN, 2 of 3 with bile duct cancer, and 4 of 9 with ampullary 
cancer, suggesting that the specificity of methylated ppENK in pancreatic fluid is 
useful as a diagnostic marker. 

15 Although the invention has been described with reference to the presently 

preferred embodiment, it should be understood that various modifications can be 
made without departing from the spirit of the invention. Accordingly, the invention is 
limited only by the following claims. 
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TABLE 1 A 



The 42 MICPs identified by MCA/RDA in pancreatic cancers* 



MICP 


Blast homology 


Size (bp) 


CpG Islands** 


Chromosome 


1 


CSX 


414 


yes 


5q35 


2 


Cyclin G 


392 


yes 


5q32-34 


3 


FU00083 


479 


no 


not known 


4 


GAD] 


309 


yes 


2 


5 


HLH 


179 


yes 


20 


6 


I CAM 5 


630 


yes 


I9pl3.2 


7 


MCT3 


308 


yes 


22ql2.3-q31.2 


S 


PAX5 


548 


yes 


9pl3 


9 


ppENK 


255 


yes 


8q23-q24 


10 


SMO 


229 


yes 


7q34-q36 


11 


ZBP 


399 


yes 


5q22 


12 


Human EST 


431 


yes 


Xp22 


13 


Human EST 


600 


yes 


3p 


14 


Human EST 


510 


yes 


5 


15 


Human EST 


359 


yes 


not known 


16 


Human EST 


370 


yes 


12q 


17 


Human EST 


324 


yes 


6q 


18 


Human EST 


415 


yes 


Uq 


19 


Human EST 


392 


yes 


not known 


20 


Human EST 


359 


yes 


llql4 


21 


Human EST 


545 


yes 


19ql3.3 


22 


Human CpG island 


386 


yes 


7q34-q36 


23 


Human CpG island 


464 


yes 


20ql3 


24 


Human CpG island 


526 


yes 


15 


25 


Human CpG island 


280 


yes 


6p22.1-23 


26 


Human CpG island 


422 


yes 


1 


27 


Human genomic DNA 


257 


yes 


17 


28 


Human genomic DNA 


350 


yes 


7q34-q36 


29 


Human genomic DNA 


336 


yes 


10 


30 


Human genomic DNA 


293 


yes 


2q36-q37 


31 


Human genomic DNA 


427 


yes 


13 


32 


Human genomic DNA 


470 


no 


21q22.3 


33 


Human genomic DNA 


264 


yes 


Xq22.1-23 


34 


Human genomic DNA 


371 


yes 


20pl 1.21-1 1.23 


35 


Human genomic DNA 


445 


yes 


4 


36 


Human genomic DNA 


357 


yes 


19ql3.1 


37 


Human genomic DNA 


312 


yes 


7pl4-15 


38 


Human genomic DNA 


372 


yes 


5 


39 


No homology 


307 


yes 


not known 


40 


No homology 


331 


yes 


not known 


41 


No homology 


304 


yes 


not known 


42 


No homology 


307 


yes 


not known 



* 7 of these 42 MICPs were methylated in pancreatic cancer and not normal pancreas (soc text) 



*• minimal length. 2O0Bp; GC content, >50%; CpO/GpC. X).5(11) 
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TABLE IB 



MTCP* 


Size 


%GC 


CpG/GpC 


CpG lolauda? 


NCBX Acc. #** 


location of clone*** 


1 


414 


64 .3 


0 . 811 


yes 


AF135523 


1-414 


2 


392 


62 . 8 


0 .613 


yes 


D86077 


35-426 


3 


479 


63 .9 


0.444 


no 


AK024484 


1645-2107 


4 


309 


64 .4 


0 . 727 


yes 


AC007405 


72554 -72862 


5 


179 


68.3 


1 .330 


yes 


AL121673 


66270-66448 


6 


630 


65.6 


0 . 625 


yes 


AF082802 


5502-6131 


7 


308 


66.6 


0 .704 


yes 


AF132611 


1082-1389 


8 


548 


60.4 


0 .683 


yes 


AI>161781 


129660-130226 


9 


255 


67 . 8 


0 . 867 


yes 


V00509 


622-876 


10 


229 


58.2 


0.927 


yes 


AC083866 


37615-37843 


11 


399 


59.1 


0 .800 


yes 


AC016663 


106325-106723 


12 


431 


68 .4 


0 . 909 


yes 


AC002365 


159360-159790 


13 


600 


65.5 


0 .661 


yes 


AC024X67 


62307-62906 


14 


510 


63.7 


0.771 


yes 


AF135520 


1-510 


15 


359 


65 . 5 


0 . 972 


yes 


AW028607 


259-376 


16 


370 


64 . 9 


1 .030 


yes 


AC026336 


11182'-11568 


17 


321 


67 . 6 


0.816 


yes 


AC068247 


66045-66347 


10 


415 


61.9 


0 .767 


yes 


AP002781 


190243-190537 


19 


392 


55. 1 


1.214 


yes 


AI557774 


1 93-392 


20 


359 


55 . 6 


0.731 


yes 


AP002354 


58648-59006 


21 


545 


65 . 0 


1 . 023 


yes 


AC007785 


13354-13898 


22 


386 


66 _ 3 


0 .605 


yes 


AC005998 


48200-48585 


23 


464 


65 .7 


0 . 975 


yes 


AF135532 


1-464 


24 


526 


64 . 2 


0.645 


yes 


AC074201 


59279-59804 


25 


280 


68 . 1 


0 . 929 


yes 


AL136305 


115130-115409 


26 


422 


63 . 0 


0 . 767 


yes 


Ai.137797 . 1 


38754-3918.6 


27 


257 


65 . 8 


0 .«B52 


yes 


AC004706 


80068-89124 


28 


350 


64 .9 


0 . 900 


yes 


AC004897 


60876-61125 


29 


336 


64 .4 


0 . 606 


yes 


AC005661 


43755-44090 


30 


293 


67 .0 


0 . 760 


yes 


AC012O33 


1 13181-113473 


31 


427 


66.7 


0.884 


yes 


AC013721 


31973-32339 


32 


470 


59.6 


0 .292 


no 


AJ239326 


83971 r 84440. 


33 


264 


63 . 3 


0.580 


yes 


A1.035O88 


102596-102859 


34 


371 


63 .0 


0.722 


yes 


AL080312 


56645-57015 


35 


445 


64 .0 


0 . 884 


yes 


AC023868 


130032-030491 


36 


357 


68 .9 


0 . 800 


yes 


AC002133 


31977-32333 


37 


312 


67 .5 


0. 885 


yes 


AC005826 


151694 -152005 


38 


372 


65 .6 


0 . 778 


yes 


AC011376 


56983-57313 


39 


307 


68.7 


. 0 . 844 


yes 


no homology 




40 


331 


70 .7 


0 . 707 


yes 


no homology 




41 


304 


68 . 8 


0 .794 


yes 


no homology 




42 


307 


68 .7 


0 . 875 


yes 


r> o homo 1 ogry 





* Total 4 clones with no homology to known human sequences (MICP38-42) and 38 clones matched t 
1 1 clones matched to the human genes (MICP1-1 1) 
10 clones matched to human ESTs (MICP 12-21) 
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TABLE 2B 
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TABLE 3A 



PRIMER SEQUENCE FOR BISULFITE-SEQUENCE 



Clone 


Orientatio 


Sequence 


Annealinc 


bbU ID 




n 




Temperature 


NO: 


RARB 


Forward 
Reverse 
Sequence 


5 *-G AGTTGGTGATGTT AG ATTAG-3 ' 
5 '-TTCCCAAAAAAATCCCAAATTC-3 ' 
5 * -CTCCTTCC AAAT AAAT ACTT AC-3 * 


56 


43 
44 
45 


THBS1 


Forward 
Reverse 
Sequence 


5 -AGAGAGGAGTTTAGATT GG-3 
,5 * -CAAAAAAACT AAAACCTC AAC-3 * 
Forward primer 


54 


A A. 

4o 
47 


CACNA1G 


Forward 
Reverse 
Sequence 


5 -TGGATAAAGG ATGTTTGGGGTTTG-3 
5 '-CCCTCCCCTTACCCCTAA ATCC-3 * 
5 1 - ACTCCCCCTCACTTTATTC-3 1 


55,53,51,49* 


48 
49 
50 


hMLHl 


Forward 
Reverse 
Sequence 


3 -A I I A 1 1 1 1 AO I AuAuu 1 Al Al /vAu-j 

5 '-CCAACCCCACCCTTC A AC-3 ' 
Forward primer 


58 


< i 
52 


MINT1 


Forward 
Reverse 
Sequence 


5 '-AAGAGAGGGTTGG AG AGTAG-3 ' 
5 , -CCCCTAAAAAAAAAATCAAAAATC-3* 
5 '-GGGTTGG AGAGTAGGGG AGTT-3 * 


62 


53 
54 
55 


MINT2 


Forward 
Reverse 
Sequence 


5 *- YGTTATGATTTTTTTGTTTAGTTAAT-3 ' 
5 '-TACACC AACTACCCAACTACCTC-3 ' 
5 ACTTCCATTAAAAACAACTAC-3 1 


60, 58, 56, 54** 


56 
57 


MINT31 


Forward 
Reverse 
Sequence 


5 '-TTTATTTATATAATTTTGTGTATGG-3 * 
5 '-CACCCCTC ACTTTACTAAAAC-3 ' 
Reverse primer 


58 


58 
59 


MINT32 


Forward 
Reverse 
Sequence 


5 '-TTTGGGAGGTAAATTYGTTGATT-3 ' 
5 '-ACCRAACAAAAAACCTAAAAAAAC-3 ' 
Forward primer 


58, 56, 54, 52*** 


60 
61 



* 55 (5 cycles), 53 (5 cycles), 51 (5 cycles), 49 (26 cycles) 
5 ** 60 (3 cycles), 58 (4 cycles), 56 (5 cycles), 54 (26 cycles) 
*** 58 (3 cycles), 56 (4 cycles), 54 (5 cycles), 52 (26 cycles) 
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TABLE 3B- PRIMER SEQUENCES FOR MSP 



Clone 


Orientation & 


Sequence 


Annealing 


5EQ 




Methvlation 






Temp. 


[D 










MO: 


P16 


Unmethylated 


F 
R 


5 * -TT ATT AG AG G GTGGGGTG G ATTGT- 3 ' 
5 '-CAACCCCAAACCCACAACCATAA-3 ' 


60 


52 




Methylated 


F 
R 


5 '-TTATTAG AGGGTGGGGCGGATCGC-3 ' 
5 '-GACCCCCGAACCGCG ACCCTAA-3 ' 


65 


53 


RARB 


Unmethylated 


F 
R 


5 '-AGGATTGGGATGTTG AG AATG-3 ' 
5 '-TTACAAAAAACCTTCCAAATACA-3 ' 


58 


54 




Methylated 


F 
R 


5 '-GG ATTGGG ATGTCG AG AAC-3 1 

5 '-TACAAAAAACCTTCCG A ATACG-3 ' 


64 


65 


CACNA1G 


Unmethylated 


F 
R 


5 '-GTTTTTTTTTGG ATTTTTGTTTTTTG-3 ' 
5 ' -TTT ATTCC A A CTTCTTC A CTTC A - 3 ' 


60 


66 




Methylated 


F 
R 


5 '-GTTTTTTCGGGGCGGTTTC-3 * 
5 '-TTCCG ACTTCTTCGCTTCG-3 * % 


62 


67 


TIMP-3 


Unmethylated 


F 
R 


y -T | T f GT 1 T TG TT ATTTTTTGTTTTT GGTTTT-3 
5 ' -CCCCCC AAAAACCCCACCTCA-3 ' 


59 


68 




Methylated 


F 
R 


5 '-CGTTTCGTTATITTTTGIT'fTCGGTTTTCO ' 
5 '-CCG AAAACCCCGCCTCG-3 ' 


59 


69 


THBS1 


Unmethylated 


F 
R 


5 '-GTTTGGTTGTTGTTTATTGGTTG^ 1 
5 '-CCTAAACTCACAAACCAACTCA-3 ' 


62 


70 
71 




Methylated 


F 
R 


5 '-TGCG AGCGTTTTTTTA AATGC-3 ' 
5 *-TA AACTCGCAAACCAACTCG-3 ' 


62 


72 
73 


HMLH1 


Unmethylated 


F 
R 


5'*TTAATAGGAAGAGTGGATAGTG-3* 
5 T -TCTATAAATTACTAAATCTCTTCA-3 ' 


56 


74 
75 




Methylated 


F 


5 '-TTAATAGGAAGAGCGG ATAGC-3 * 


58 


76 




R 


3 '-CTATAAATTACTAAATCTCTTCG-3 * 




77 


E-Cad 


Unmethylated 


F 
R 


5 '-TAATTTTAGGTTAG AGGGTTATTGT-3 1 
5 '-CACAACCAATC AACAACACA-3 ' 


53 


78 
79 




Methylated 


F 
R 


S'-TTAGGTTAGAGGGTTATCGCGTO' 
5 '-TAACTAAAAATTCACCTACCGAC-3 1 


57 


80 
81 


DAPK 


Unmethylated 


F 
R 


5 '-GG AGG ATAGTTGG ATTG AGTTAATGTT-3 ' 
5 '-C AAATCCCTCCCAAACACCAA-3 1 


60 


82 
83 




Methylated 


F 
R 


5 '-GG ATAGTCGGATCG AGTTAACGTC-3 * 
5 '-CCCTCCCAAACGCCGA-3 ' 


60 


84 
85 


MGMT 


Unmethylated 


F 
R 


5 '-TTTGTGTTTTGATGTTTGTAGGTTTTTGT-3 ' 
5 *- AACTCC ACACTCTTCCAAAAACAAAACA-3 ' 


59 


86 
87 




Methylated 


F 
R 


5 '-TTTCG ACGTTCGTACCTTTTCGC-3 ' 
5 '-GCACTCTTCCGAAAACG AAACG-3 ' 


59 


88 
89 


M1NT1 


Unmethylated 


F 
R 


V-GGGGTI GAGG TnTnGlTAGT-3' 
5 '-TTCACAACCTCAAATCTACTTCA-3 ' 


64 


90 
91 




Methylated 


F 
R 


S'-GGGTTGAGGriTTTTGTTAGCO* 
S'-CTACTTCGCCTAACCTAACGO' 


64 


92 
93 


MFNT2 


Unmethylated 


F 
R 


5 '-GGTGTTGTTAAATGTAAATAATTTG-3 ' 
5 ' - AAAAAAAAAC ACCT AAAACTC A-3 ' 


58 


94 
95 




Methylated 


F 
R 


5 *- AATCG AATTTGTCGTCGTTTC-3 * 

5 ' -AAATAA ATAAATAAAAAA A AACGCG-3 * 


60 


96 
97 


MINT31 


Unmethylated 


F 
R 


5 '-GAATTGAGATGATTTTAAT7TTTTGT-3 * 
5 '-CTAAAACCATCACCCCTAAACA-3 ' 


64 


98 
99 




Methylated 


F 
R 


5 ^TTG AG ACGA1TTTAATTTTTTGC-3 ' 
5 *- AAAACC ATCACCCCT AAACG-3 * 


62 


100 
101 
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MINT32 


Unmethylated 


F 


5 AGTGGTTAG AGG AATTTAGGT-3 ' 


62 


102 




R 


5 '-CTAAAAAAACA AAC AAAAC ATCCA-3 ' 




103 




Methylated 


F 


5 '-GTGGTTAGAGG AATTTAGGC-3 ' 


64 


104 




R 


5 AAAACG AACG AAACGTCCG-3 * 




105 



BNSDOCIO: <WO 02068694A 1 Jj> 



WO 02/068694 



PCT/US02/05681 



53 

What is claimed is: 

1 . An isolated nucleic acid molecule selected from SEQ ED NO: 1-42, and regulatory 
sequences associated therewith. 

5 2. The nucleic acid molecule of claim 1, wherein said associated regulatory 
sequences contain CpG-rich regions. 

3. The nucleic acid molecule of claim 2, wherein the state of methylation of the 
CpG-rich regions is determinative of the presence of a cellular proliferative 

10 disorder in a subject from which the nucleic acid molecule is isolated. 

4. The nucleic acid molecule of claim 2, wherein hypermethylation of said CpG 
islands is indicative of the presence of a cellular proliferative disorder in a subject 
from which said nucleic acid is isolated. 

15 

5. The nucleic acid molecule of claim 1, wherein the molecule is selected from SEQ 
I D NO:39-42. 

6. A substantially purified polypeptide encoded by a polynucleotide selected 
20 from SEQ ID NO:39-42. 
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7. A method for detecting a cellular proliferative disorder in a subject 
comprising: 

a) contacting a nucleic acid-containing specimen from the subject with an agent that 
provides a determination of the methylation state of at least one gene or associated 
5 regulatory region of the gene selected from MICP1-42 of Table 1 and 

combinations thereof; and 

b) identifying aberrant methylation of regions of the gene or regulatory region, 
wherein aberrant methylation is identified as being different when compared to 
the same regions of the gene or associated regulatory region in a subject not 
10 having said cellular proliferative, thereby detecting a cellular proliferative 

disorder in the subject. 

8. The method of claim 7, wherein the regions of said gene are contained within 
CpG rich regions. 

15 

9. The method of claim 7, wherein the gene is SEQ ID NO:39, 40, 41 or 42. 

10. The method of claim 7, wherein aberrant methylation comprises hypermethylation 
when compared to the same regions of the gene or associated regulatory regions in 

20 a subject not having the cellular proliferative disorder. 

1 1 . The method of claim 10, wherein the regions comprise regulatory regions of the 
gene. 

25 12. The method of claim 7, wherein the agent is a pair of primers that hybridize with a 
target sequence in the gene or associated regulatory region of the gene. 

13. The method of claim 7, wherein the nucleic acid-containing specimen comprises a 
tissue selected from the group consisting of brain, colon, urogenital, lung, renal, 
30 prostate, pancreas, liver, esophagus, stomach, hematopoietic, breast, thymus, 

testis, ovarian, and uterine. 
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14. The method of claim 7, wherein the nucleic acid-containing specimen is selected 
from the group consisting of serum, urine, saliva, blood, duodenal fluid, pancreatic 
fluid, cerebrospinal fluid, pleural fluid, ascites fluid, sputum, stool, and biopsy 

5 sample. 

15. The method of claim 1 1, wherein said cellular proliferative disorder is selected 
from the group consisting of low grade astrocytoma, anaplastic astrocytoma, 
glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal 

10 adenoma, acute myelogenous leukemia, lung cancer, renal cancer, leukemia, 

breast cancer, prostate cancer, endometrial cancer and neuroblastoma. 

16. A kit useful for the detection of a cellular proliferative disorder in a subject 
comprising: 

1 5 a) carrier means compartmentalized to receive a sample therein; 

b) one or more containers comprising a first container containing a reagent which 
modifies unmethylated cytosine and a second container containing primers for 
amplification of a CpG-containing nucleic acid, wherein the primer hybridizes 
with a target polynucleotide sequence having the sequence selected from SEQ 

20 ID NO: 1-42. 

17. The kit of claim 16, further comprising a third container containing a methylation 
sensitive restriction endonuclease. 

25 18. The kit of claim 16, wherein said modifying reagent is bisulfite. 

19. The kit of claim 16, wherein the primer hybridizes with a target polynucleotide 
sequence having a sequence as set forth in SEQ ID NO:39, 40, 41 or 42. 
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20. Isolated oligonucleotide primer(s) for detection of a methylated CpG-containing 
nucleic acid wherein the primer hybridizes with a target polynucleotide sequence 
having the sequence selected from the group consisting of SEQ ID NO: 1-42. 

5 21. Isolated oligonucleotide primer pairs selected from SEQ ID NO:43-105. 



BNSDOCID: <WO 02068694A1 J_> 



WO (12/(168694 



PCT/US02/0568I 



1/14 



FIGURE 1 




B 



PpENK 
MICPI.V 
MICP36 
MICP27 
MICP28 - 
MICP29 



BNSDOCID: <WO 02068694A1 J_> 



WO 02/068694 PCT/US02/05681 



2/14 



FIGURE 2 



CLONE 2 
< > 

Exon 1 Exon 2 

1 I MINIM! 1 1 1 1 1 11! I I 

m—m 300 bp 

MSP primers 1 1 



CLONE 8 

< > 

Exon I Exon 2 

ppENK — I 

II II 111 illlil I III I IIIIBIllllillHIIIRB I 

• 300 bp 

MSP primers x ' 



BNSDOCID: <WO 0206869 4A1 J _> 



WO (12/068694 



PCT/US02/0568J 



3/14 



FIGURE 3 



soooooooo 



t000000000000000©@©0000 



00000000 



0000000000000G90080000 



0-000000000000 



0000-000 



IttMOtOO 

•09Q0000 
M0IM0 



0S!©§§§SM©$O$OGIMIM 



000000003000000^000000 



3 r£ Si S S 

" '2 



• HUM 



I IMM9 



'2 ' 



02068694A1 I > 



WO 02/068694 



PCT/US02/05681 



4/14 



FIGURE 4 



B 



lOObp 



150bp 
lOObp 



PC«2 


PCal9S PCol90 PCfc240 KP186 


HC13S 


D 


H 


O M O M U 


HUH 


O M 






— 




PGal 


PC*4 PC* 666 KP248 




o 


M 


U M O K O 


MUM 


D M 


J 2 * 




fm^mm . «- 


IBB* 





GArnJi 

Cvcthi C 



< 

< 



04 

.2 c 



— C> vO sC 
m 0\ c*V «=t 

— — »n i^, 
C- CU C_ d. 

2: 2 2 



D 



< 
< 



C- 



SAza-dC 
ppENK 



c. — vO %C 
^ C ^ Qu 

1 21 2: z 



BNSDOCID: <WO 02068694A1 J_> 



WO 02/068694 



PCT/US02/0568I 



5/14 



FIGURE 5 



8 

o 



o 52 

E 
c 

<D 
O 

w 
<D 
<D 

E 
jo 
■o 

o 
E 

=3 



o 
o 



8 

o 
o 

o 



8 
8 



* CTJ 

a> 

>> 



BNSDOCID: <WO 02068694A1 J_> 



WO 02/068694 



PCT/US02/05681 



6/14 

FIGURE 6a 

PL3-17, CSX, MINT23 MICP1 (SEQ ID NO:l) 

cccgggcgcccggccctggctcgcggaatgggcggccagatctcaggccctgcgtgcccgagctcagtcccagttccaaccgggg 
gtgcccatggactctcggagggcactcctggggggacagctaagacaccaggctgcaggatcactcattgcacgctgcataatcgcc 
gccacaaactctcccglgcgcaagaacaaacgcgcgtgggacagaaaaagttcctaggtctccgcaggagtgaatgcaaaatccag 
gggactcagggtcatgttgggagccccttctccccccgagagtcagggagctgttgaggtgggatcggtgagggtcgcgccacgcg 
ggtcccttccctaccaggctcggataccatgcagcgtggacactcccgagttgctctgcggaatcccggg 

P18-13 cyclinG micp2 (SEQ ID NO:2) 

Cccggggagggtggttaccgctgaggagctgcagtctctgtcaaggtgagtgggactgcgcgagagttgaccgccagtgcgggtg 
gggagctgggttgggggcgcggggcgaggagtaggtctggcccgcgcccttttccacactaaactctaccgctgttgtgagcacaag 
cccaggctagtccgaggctggaggggcggagcggatccggcctcctgaggtgcctttcgtgtctgccgacccagtcccagggacta 
gcctggggaggaagaatggaacccctgcagttagaggttcctcacatgactagctctgaagacctcctgccttcctgtctttagttggtgt 

gggagggaccttccatgtatccagggcttagcttgtgcccggg 
PL-3-12 MICP3 (298 bp, ECEL-1)(SEQ ID NO:3) 

Cccggggagagctgatcccggacaggggttccgaggaggtaaaggaagctccctggtgaaggtgagctctgcgcaggcgclgtgc 

gggcagcccaggcactccgtcagcgccctctcaatccttaggaggagtcctccctgggtggtgctccagggctgtgggctggaccat 

gacggcccccagctgcgtagctccagctcctttccaaccggttggtgggcggttccgagtggggacacgggatcggaggggactcct 

gccgggtgtacccgctaccctcactagtcccgggatgcg 

P13-9 309bp, micp4 gadl AC007405 (SEQ ID N0:4) 

Cccgggacagagagattgcgggctaatctgggtagatcgaggaccccacagagaagggcccaccggccatcgcctccacaccctc 
cctccgacctcactcctagccccgcgcgcgcagtcgcacagcaactcgggcagcgggtcgactacggcccctggaaaagtaacag 
gttaccgttcctagtagcgccttcggctgctgctgcccaggcgccctttggagggacaggcgctcgcagcatggaagctcaagggaa 
aatcgctcttgccccacttcaactagagcctcagcctccgtgcttccccggg 

PL3-18, 2 nd , ICAM-5 (630 bp, MICP5)(SEQ ID NO:5) 

Cccgggagcatgcgggcacttaccgctgcgaagccaccaaccctcggggctctgcggccaaaaatgtggccgtcacggtggaatg 

tgagtaggggcaccgcggagttaggcaggatctgtgggacaaccccggctggacttcctggcccccgtgtgagcccctgcaatcctg 

tttcccagatggccccaggtttgaggagccgagctgccccagcaattggacatgggtggaaggatctgggcgcctgttttcctgtgagg 

tcgatgggaagccacagccaagcgtgaagtgcgtgggctccgggggcaccactgagggggtgctgctgccgctggcacccccaga 

ccctagtcccagagctcccagaatccctagagtcctggcacccggtatctacgtctgcaacgccaccaaccgccacggctccgtggc 

caaaacagtcgtcgtgagcgcggagtgtgagcgaggcccaggcgggtagggagcagggtgccccacggtccaggcactccctga 

catcccccatggctgttttgcagcgccaccggagatggatgaatctacctgcccaagtcaccagacgtggctggaaggggctgaggct 

tccgcgctggcctgcgccgcccggg 

P18-9 308bp, micp6 AF13261 1 MCT3 (SEQ ID N0:6) 

Cccgggtggtgaccctgccctgcccacaggccccgtgtccagcatcctcgtgacccgctttggctgtcgcccggtgatgctggcggg 
tgggctgctggcttccgcgggcatgatcctagcttcctttgccacgcgccttctggagctctacctgaccgctggggtgctcacaggtga 
gggccccctggtctcctctccgctgggttgggggtcgggggttcttgctgcaagatctgtcctcggtttccctatgagggacagtcttcg 
aagtccctcggctgggttcccggatctgctgggttcccggg 
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FIGURE 6b 

P18-17 548bp, micp7 pax5 (SEQ ID N0:7) 

Cccgggaatccacagccacatttccgattgtggcgaaatctgctcagtggctctgcgatgtccagttgccgccgggggcccccctttcc 
tcgagctctaggctcttccccctaggttgcgacccagcctcgtgaccaccccctccaaaaaaacaaacaacactcttgctgaggacgat 
tcactctccaaaactgcattgtccggcgggccaggagtctctacggacgcgtcccgcctgagacttccctcgagccacccgctgcagg 
cccacggcttcagtggctaggcccagcagctgaaccaactcaaggctgggggggaacaggagggaggcttgagtctggcccgaag 
agagagggctggacatgccacacctctgctcggtctctgtggatctgatttcctctctggaatcaagtcctggggctctgggactccaca 
acgtctcagggctcgagggcaatgcgattccacttacgggccggggtaaggtgtctggaactcccgccaatctccagaaactactgag 
atgttctgctctgcccggg 

P13-50 255bp, micp8 ppENK (SEQ ID NO:8) 

Cccgggaaccgcgaggcgatctgagtcgcctccacgtctacctaaaagctgtcggccgggagggcggggccccagaaaggagca 

ttcctgcgggcttttgctcgacgatcccctgctgaggctgtcgcggcgagggtcctgccgagggaccccgttctgcgcccaggcagg 

ctcgaagcacgcgtccctctctcctcgcagtccatggcgcggttcctgacactttgcacttggctgctgttgctcggccccggg 

P13-3 431 bp, MICP9 (SEQ ID NO:9) 

Cccgggcagagctcggagcgcctccatccccggaaccaggggactccctggagtgctccggtccaggctacgatcgaggcgccc 

ccatcccttgggcccagggagaggatcggagacaccaggaggccctcggggctgggtgaagatctttggttccgggggtccggga 

gaggatccaccctcccaataccccgactcccagggctctgaccaagaatggaggtgccccttctccaggcctcgagccctctgagcg 

ccgaggccggccgcctacaggtcccccgccgctgggcggaccctctcattcggttccctcacgtcacccgctgtccggcgcctggga 

actgggctcctggaatttcctctcctggggctgacagatggccctcttttcctttctctgcggcagcctcgcccatcccggtcccggg 

PL3-14 MICP10 (600bp) (SEQ ID NO: 10) 

Cccggggacggggagggaggagggctgccgggatgtgaaccggggaaggcagctggggctggagagcagcgcggaaaggg 

ggcccagggagctggaaagagggccaagaggagggcaaggaaggtggcgggcgacggggagaggaaagaaaaagggtgtctt 

ggcggtggccttggtaagagaaaggggcaaggggtataattgacaaggcactgaaagtattgaagtcagagccttgggaaggatcta 

ccgaactctcggcggtccacgcggggacagacctcagcccgtgagccttgagctccacgcggggacagacctcagcccgtgagcc 

ttgagctccacgcggggacagacctcagcccgtgagccttgagctccacgcggggacagacctcagcccgtgagccttgagctcca 

cgcggggacagacctcagcccgtgagccttgagctccacgcggggacagacctcagcccgtgagccttgagcccagaaggagtgg 

cagcctcaggacgtttgccaggtggcctggaatgtgagggaagcctcagccccgccaggaacagagctggcgctgagtlcccggct 

cggcccggg 

PL3-23 359 bp, MICP1 1(SEQ ID NO:l 1) 

Cccggggcgcacgggcgtgagggtggggtctcatcgcaggggcgccgggagcctccccgctccgctagctcaaccaaggaccg 

ctcagaggggctctcaccctgaacctcggcttttctaaaggagcgagaccagattcccttctcttctcgacgtcgtttggtgttttctcgttct 

tttcgactgagcacggcaatgcgcagacgtcgacgtctctcactgctcatcccgatctgtaacctgaggtgagcccgaagaccgctgc 

cgccggcggccaccccagcgcggglccgctgaggatggaaacagcaagtgcgcgccggccaggccgccacctctccctcctcca 

acagcccggg 
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PL3-25 370bp, MICP12(SEQ ED NO: 12) 

Cccgggaggtagcgggagattgcacacgcgatcctgcgagtttccgaactttggaagatcgtgacccggagagacccttgggagga 
gagggccggccacctcctaggggtgctgtttttttaagggtcaacccaggacgctacgggaagcacctggcgcatccttggaacagtg 
ggcttggtggcccaccgcaacgcctggcgggaaggaatggcggggcatcgtgtgcctaatggaccccgtcacagacccgccccaa 
tccgaggggcggatgagctcagaggacctgcccaggacgctccttctccactttccaggaaaaccgacggcgtgcgcgcctccgtgt 

cctcgcggagctggggtccccggg 

PL3-32 324bp, MICP13(SEQ ED NO:13) 

Cccgggcctgggctgtgccagcccggcctgccagtctcggcccccattctcgtacggagggatgcggcgcctggggcctcgaagc 
tccgggcggttttggagaagttgaagctcagccgcgatgatatctccacggcggcggggatggtgaaaggggttgtggaccacctgc 
tgctcagactgaagtgcgactccgcgttcagaggcgtcgggctgctgaacaccgggagctactacgagcacgtgaaggtgagctgct 
tggcgccctcccgccgagccccgctgctcggccttccgcaatccgcagtccctaccttccccggg 



P13-49 392bp micp!4(SEQ ID NO:14) 

Cccgggaaagacctcgagagaccttcttcaacacgtctccaggccagattcccctaccatggctcgggagcaatgacgcgctccccc 

tcgccactcgcatggagacccgacttccctggcgccccacgagaggctcactgacctcgccgtcgtacctcgtgagagaccgcacct 

gggccgcgtcgagacacccgagattccccgtcaccgagagcttgaggccttcgtctcctgcatggcctagagaccaatctcgcgacct 

gtctccaaacgccctcaggaggcttgactcccttgagtccgcccagtgagctccaagagatacccgtcgcgattcgagagcagagcg 

gggttctttgcttccactcgagatgaatgcctgtctccccggg 



P18-1 358bp, micplS AP000913.2(SEQ ED NO:15) 

Cccgggagaggcgacagcctcatccgtttatttcctcccttgaccatttgttcagcgactctcccctccgttcagcatccaggttccttacg 
gctacagtgccccagccccgcctcaccagcgcgacattctgccctgcctacccactcagacacagtgcccttttcggttcttcaaacttg 
ctaagcgttttcctatcgatatctgcaggtaacagatggcacgctctcaaatagggtaatcggaggagggtctaataaaggaactattttc 
aacagcggagtaggcgttagggactccagtaggagtagggctgtgtcccaggatactaacagcagggcgccttggccgccccggg 



P18-5 545bp, micpl6(SEQ ID NO: 16) 

Cccgggctgtgcggagatgcgcaggcttggggcggcgttcaggagggactgggaaagggatcgtgccctagggtctcctggtgcg 

aaagggtgcggcgcagcaggtgggatcagggtgaggtccgctggcatttatggggtgggtggtggaaaattggaaagagtttccgg 

ggagttgtggaagactccggaaagaagggtctgttgaaggcggtgtgttgaaaggatgtaggggaatgacagaggggtgttttaggg 

gtccaattgggtgaggtctgggggggaagatatcgagagggtcatgggcccaggagtgcacgtctaggagttgatggggtaggcct 

gagggttcggagaaggtgcggtcggggaggagtctcgcgattgagtttgtcggggcgggcgggaagctgacagctgcctctgtggt 

ctcaggaggcggactccgccggtgcagccgcccccgacgcgggaggaccgggcctcatcgtcccgggagcgctccgcgtcgcga 

ggccgcggcgccgcgcgctcctcatcccggg 
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PL3-16 MICP 17(510 bp, MINT 20) (SEQ ID NO: 17) 

Cccggggggcgctgaggttgccttctcaggcggaggaggcgaaccctgtagcccgacaacgccgggcttcgattttgaggagcttc 

ttccgggatgtcgctatactggccggagacgggccttgcgtgtccattgggggatggcgatggggaccagtttccgggtagagaagg 

agataactcgatacaggaacgcacaggcagcctgaaagcagcccaggatctcgacaggggaagggaagaccctgctccctttgccc 

aaatcctcccgccctgtccttgccttctgtcccagatcccaagcccctcgtacctgcactcgggactcggtcagcccgatacgcagtgc 

cagctcctcacgcataaagatgtcggggtagtgagtcttggcgaagctcctctccaactcgttgagctgtgcgggggtgaagcgcgtc 

cggtggcgcttctgcttctgttggccctgctgctggccggcttggctggggttcgggccgccctggggcccggg 



P13-8 386bp, micpl8 AC005998(SEQ ID NO: 18) 

cccgggaaacagttcaggacgctcaagaccagaagcgggagcaaacccaaaaggagctccaaggaggtgtgtgtggggagagcc 

agggggacgcaggactaggctctttcctgcgcaaggggtggggaaacccgcgaaagccagggagtcgcgcgcactcacgccctc 

gcgccaccaggcagagccaccgctgcaaggagcccacgggtgcgcgctcgctccagggcggatctttccacacccccctcaccct 

caaaagctcaggctggagcggtcatcagtgcggactccggcaccccacccacccagcaggggttaaggagggactggcgcccact 

cttgcctacagctcctgcgcagggctccagccgccaaatcttcccggg 

PL3-33 514bp, micpl9(SEQ ID NO: 19) 

Cccgggctgaatggtagacgttctggcgccgggcagcggccaccggctggttcccacttccgcgcgcaccccttaaactgtgttcta 

gaggccccagcctcgccttgcagcgcctcactagctcctgaggactagggactcggcggtgaggcgggttggcggctcgaacgag 

ctgggcgtcttcgttctctctcgctgcctggctggctcagctggccctccacagctcggagcaaggccatagcagggagtggaggtat 

attgggctgtcacctccttgctggccggagttatttgtagactacagactcggaagaacagacgcgccacgctctcgcttggcattgcttc 

ggatcgcagctcctccttggggtgccccagcttggcgtttatttgcctgcgccaggctctggcnacggtcaccgggccagcggggag 

ggacggacggcaggtgaccagcctctgctgtgaagaaattcctgcgcgcccggagctgtccctaatgcatcccggg 



P13-51 279bp, micp20(SEQ ID NO:20) 

Ccggggatcgggagcttggagtgaggngctcggcgtgacccgtgaggagccccgcgggtagagcggcgctgccccttcgctccg 
gggtgaactgaaactttgctaggggagagggtcggcgccagcctcgcggggttcggagaagacccagcgctgtgcgaggtcgggg 
ccgggcagggcagagcaggggtgaaaggagagacctgtaatgacggcgggatttggggtgcggagggttgcgagggaggggcc 
gcaaccctgaacaattgcattcccggg 

P18-21 422bp, micp21(SEQ ID NO:21) 

Cccgggtctggcttggtctcgcccgcgcagatcccgttcaaactcagctgccaccaagtgcgccttttctctctggattgcgattctgca 
cgaattttccagttgagggtggttcggcgctcagccagcctctgccctcgaagacgcggccttggtctaggaaccttcaggtgggtgttt 
gggcgcagtggccctagttccctagaattcgttttgcctcccgcggcctcagccgcgtggtcagcgagctcgcgagaacgcagctgg 
gcagtcccggacggctctcgggcgcttctagggagcagtcaaagggttgcgaggactgccagggtccttcctccctaggcttcccac 
actggatggccgtaactcagtcgttgacggcgacagccagggccctgatggactgtgcaggtcccggg 
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P13-18-2 mint32, 464bp, micp22(SEQ ED NO:22) 

Cccgggtacctgcacagctcgctccctcccatccttcgggtcttcgctcgaacgtccgctcctcggtgaggccttccctggacaacgca 
tttgaaacgtaaccccaaggcaagaagccaccttccaggcgcgcagccgaagcccagtgccaaggaggccggagactcgggtgcc 
cgcgcatcccgaaaacagcctctgaggggtcctctgagcatccttccagcgtgtttgggaggcaaactcgttgaclagctcttgagagg 
agtggctagaggaatccaggcggggaaggggacggtggactccaggagagtgtaatttacaaaggcggggggcggggacgccca 
ggtccgagtcccaggactctgcgccggacgcttcgcccgccctttcaggtcccctgcccggtcctcgtacccgcgcgggtccggaga 
acctctgagcaccggcccccagcccccggg 

P13-4 257bp, micp 23(SEQ ID NO:23) 

Cccgggatcggtgcccgttagtgggcacaggacgccgggtgtccgaagggctgcctggagatgcaactgaccttggcgggcatca 
gaatgacgagtggcagcagcaggagcggggtgacgaacaagatcacgaaggacttgaacttggagacatagctcagcgccgagg 
.ccatcgcgcgggagggagactggcgggcgagacgagtgaggggcagctagaggcgccgcgggcttaagaaggggccacagtc 

cccggg 

P13-5 350bp, micp24(SEQ ID NO:24) 

Cccggggatggcgcgccaggtaaggcaggcggacgcgcggacccctcggcgcagtgcggccccgaaggcggtagggcgaga 
aagagctgggacccccccgcaacttcgggatcggctgggtgaggctgggcaggcctgatgctcaggggttcgtgccgggaagttca 
ggtcctcggacaacctctctccagctctccgccgccggtacccacggcccagtctccacctggggaaacccccttggcgtggcttgttt 
cgttacaagttatcctggtagagtgggcatgaaggcctcggaggcagtgagtaaatctcatacttctgttcttggtggaatcctggatccc 

ggg 



P13-6 337bp, micp25(SEQ ID NO:25) 

Ccccgggctgccggctggagcccggcagattcgctgcgcagcactgccccctggtatccagcgccgaaagtgcccccctccgcag 
ctgcaaggctcctccctgggctgcgcgggacagattttttctcctttcctggctacacgcctaacagagaagctatcccgagggacctca 
agaagtccccccaagccgtactcaaaagcctttctccctcctcctcaagcgctcacttccccaaagaggacccggacccctgactgcct 
gagccaggtccccagcatggtccgcaacccttctcgactccggcatccacctccaggctgacgtctacccggg 

PL3-1 1 MICP26 (229 bp) (SEQ ID NO:26) 

Cccgggggaaagagatgagtggaaatcgtgggccggacctgcaaggagagactcggcgggcaccttgcttggtctgaggtcgtct 

gcaggaagcggacttttctcctggctcaggatgggaaagacaggggatgcctgaagtcaacggggacttctgttccatctctgccccgt 

tctccaggcccgccagtttttcctgctttggttagattttccaacgtatcccggg 
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PL3-13 MICP27 (427 bp) (SEQ ID NO:27) 

Cccggggctgctgaccgagccccagggcagcccgcccttccccctcctgaagcccgacgtcttcttcatggctccgtctaccggcttc 
ggagacccgctggacttttcgccgcctccggaaccctalatgaggaagcaaatcgcgtccgccacagcctccaactaggaaactccg 
cgactctcagcccctcagaagagaaacggagacgcgccaagcaaagccgttacacggactgtgcacgcgcctccggtgtccctgcg 
cgtgacacaaatttggccccgagggagctccatgtgcctgagtcccaggagccctagatgccagcgacagcttgtcaccaggcctgc 
gacgccaatgggcgggagtcggcggagctcaggacactgacgacgggcctgggggaaagcggtccccacacagcccggg 

PL3-19 (470 bp, MICP28) (SEQ ID NO:28) 

Cccggggacagggggacccccagatgctgcacggctgacaggccaacgtggcagaagctccagcttcacaggaagccagtgacc 

atgagagtctgtagctgtaacgaagccacagagctgtggctttctttccccttcagctctaggaaaggttatctgccctgcacagatctcc 

ggaggcctggctgggctctgagagcatcagactgattatcgtaagaaaataatctctgcagacacattccttgctagaagcaggggaca 

aagcccagcttcaaagacaattccacacacgccctccctgccctgcacagctgcctgccgggtgggagcagagcccttgcagccgg 

gctcaggggcctgggcagggacagcgtgtggcaggggcacagctgagacaggagcctcaaagcgacaccaacccgacgtgaag 

ctacagttgaggagacacagctgcccccattcccggg 

P13-20 (bp 264, MICP29) (SEQ ID NO:29) 

Cccgggcggcggccggattgcgggtgggtgagaggcagcagacgccgtgtttacagctctctcgctagttcgccacclcagccgcg 
gctctagggctgagccagtcgcctccttctttaagattctggtcacagcaggggctgggtttctaaggcaggtttctaaggtgtcttcctac 
agacaccgctgctgctaccttgctaccttcagcgctgggcacagccaggggcagcgcgagagggaggcaacgagagggttcccgg 

g 

P13-21 (bp 371, micp30) (SEQ ID NO:30) 

cccgggaccccaatgccagggaggggcctgcaggaccccagcggtgggcgagttgtgtcctgggtcaccttgtgtttcgcagcgtg 

gcggtggcaggagcccagcgcgggaggacattttcatagcctcctacagtgagaaacgccccccacccgacgctgcgctcatctgtg 

tccccgctgttgccggggctctggtatccacttgcgcgccctatgtggtggggatccacccagagcccagcgtcaagttatacgggcg 

cttcactcagcgtcagccaagaccagggaagcgcttcttgccgtttaggagacgtctgcaagagataaaaagctagcccacgatccac 

ccacaatcctcgtgtccccggg 

PL3-22 bp 1 79, MICP3 1 (SEQ ID NO:3 1 ) 

Cccgggaactcgcggcacccactgggtattgtcgggacccagcaagtctaggaacgggggtgggtagagcatccttcgggcactg 

ccgttcgtccccaaaagaagaccaccgcggggtcccagggccacggcgaggacgggcactggtcagattccggacaggcggtcct 

ggccccggg 
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FIGURE 6g 

PL3-34 445bp, micp32(SEQ ID NO:32) 

Cccgggctgtgaccagcgaattcgggccccgcaggtgcagctgataggagaaatccggctccgggagcgaacccagcggcggaa 

aggcgggctccgcgcccaggcgggccttggactgcaagaaggcgaggatgcgcgcgtacttcgtgtccttggtctcatcgtcacgtg 

tgagtatcgaccaggtcatcatcgcacgtggtaccatagtggaagtagttggcaaactcgctagagtctgctggaggaacgagcccgc 

cgtaggacggacacacctgagtgcccctcccacgcgagcccaaagcgggtgcagggcacctcccaccacatttctggccaaagttc 

ccatttgaggcccgccctctcctctgcgcagtcttagagactggcgaggcacgcgcaaacgccctcttccctgagacctgaccccacc 

cacccacccggg 

P18-2 357bp mic P 33 AC002133(SEQ ID NO:33) 

Cccgggggagccccggaccccgcatcccccagggcgcggaaactggcgaggccccaggagctcccatttatagctcagtttccac 
tgagcgcagtccctctaggacctgggctgagcaagtttcttccactctctcccttccctcctcctcaccccttgcctgcccctcaaccccg 
gcagggcgcaggtgtccaacccagccgggaccccctccctcctcgaacccaggtgttccggctcccagaccccaattgagctgggg 
gcgcccacccgccgggggatcccgccctgcgtcccccattcatccgcgtctcagccgcgggagtttctcaacgggaagagggcgga 

gctcccggg 

P18-3 312bp, micp34 AC005826 (SEO ID NO:34) 

Cccgggaggagagttggggcttgggggacgccgtgaactccatggtccccagcacgcggtcctggccagggacggggtcgtccg 
aactgccgtccagattccccaagggagacaaagacccgaaacacagctcaaagtttccgagagcagtcacagcggggccagggac 
tccagaagtgtcagctccaacgactccagagctgcacactggcctctattccccaccgcaaagccccagagccgcagagacttcgaa 
ggcagccggagaggagagggcccaccgagcactacggcgggtgcgcacgccccggg 

P18-6 372bp, micp35(SEQ ID NO:35) 

Cccgggagccggctcgctggcggcgccaggccacgctctctcattaacatcccgctcccggtggcgcaggggagccggccaaagt 
tcctcgcaaagtggcgagcgaaggagcgctgagcactgacgtctgggctggggaggagcgggtccgagcgaggacggagaggg 
gacagagggaaagggaggcgggtgtcttcctcaggaatttgagctggggatctgcatcctggccattgcagtcctttagcatcctcgcc 
gcgccctgagcgcgctggaggctcgcaggctgcgccctcccagggctgatgccgcgtcctgctccgccgttctgggacgtcgggga 
caaaagtggaggagacgggagagcccggg 

P18-7 399bp, micp36 AF13261 1(SEQ ID NO:36) 

Cccgggctgcgagcgcggctcctggattccagcctcccgcccttcccaggcgctggaatggacacggacgcccacagtggcgggc 

caggtagtgccggagtcgggggcccaggccgcggcgccccgcgcctcatcacttaccttgcctttagctatcaattccatgatgtagcc 

aaattcactcatctccccagactccgacatgtttacaccccttcacaaactctggaggaccgacgcgggtgtatcgaatttgtcctttctttt 

ctctttttctgtttttagtctgagttttgccgagctccccgcccataagctgttaaccaggaaaagaggggaagcgccggggaaagcaag 

aagcgggcttgggtgaaatgaaggccatcgagggctcccggg 
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FIGURE 6h 

PL3-31 307 bp, MICP37(SEQ ID NO:37) 

Cccgggcccggtgcgcaccggtgccgacttggcagccgccctgtgcgctcgacgaaagggtgagaaggaggcaggagtgcagg 
cggaaggagtgggcaatcagcggcggggacgagagtgtgtcttcggggaaaccaagtctgagtgagcgctgaaggggagtgtgcg 
gagcglgccgtgcaccccgagccccccgcctcattgcctctcgcctctctccacctgccccatgatctgcgccagggaccggtcctct 
cccgtccgcaggctgtctaggtggccgttctggtttgctgggacccccggg 

P13-37 331bp, micp38(SEQ ID NO:38) 

Cccggggccacctctgaggcatgaacccagagacgcgcgccctggtctgggaaagcaggaccgctgcgcccagcgcctcaggg 
gtagaggcgggaacaggcccgcggtcgctttgctggcggcggggaagggcgatctgacgatcagggagttgcgcccctctctctgg 
gcctcgtgaaggaacaagagcaattacagcgctgggccggccacgtagtcctggggctaggtgggcccaatgctccgggccgcgg 
ggctggagcgcggaggctggagagggaggaggaccctccgcggctccaggtctcccagctggaggctcacgcccggg 

P13-43 304bp, micp39(SEQ ID NO:39) 

Ccgggcccggtgcgcaccggtgccgacttggcagccgccctgtgcgctcgacgaaagggtgagaaggaggcaggagtgcaggc 
ggaaggagtgggcaatcagcggcgggacgagagtgtgtcttcggggaaaccaagtctgagtgagcgctgaaggggagtgtgcgga 
gccgtgccgtgcaccccgagccccccgcctcattgcctctcgcctctctccacctgccccatgatctgcgccagggagccggtcctct 
cccgtccgcagctgtctaggtggccgttctggtttgctgggccccggg 

P13-44 307bp, micp40(SEQ ID NO:40) 

Ccgggggtcccagcaaaccagaacggccacctagacagcctgcggacgggagaggaccggttccctggcgcagatcatggggc 
aggtggagagaggcgagaggcaatgaggcgggggggctcggggtgcacggcacgctccgcacactcccctccagcgctcactca 
gacttggtttccccgaagacacactctcgtccccgccgcgtgattgcccactccttccgcctgcactccagcctccttctcaccctttcgtc 
gagcgcacaggcggctgccaagtcggcaccggtgcgcaccggcccggg 

MICP41 also known as PL-3-2, 43(SEQ ID NO:41) 

Ccgggcccggtgcgcaccggtgccgacttggcagccgccctgtgcgctcgacgaaagggtgagaaggaggcaggagtgcaggc 
ggaaggagtgggcaatcagcggcgggacgagagtgtgtcttcggggaaaccaagtctgagtgagcgctgaaggggagtgtgcgga 
gccgtgccgtgcaccccgagccccccgcctcattgcctctcgcctctctccacctgccccatgatctgcgccagggagccggtcctct 
cccgtccgcagctgtctaggtggccgttctggtttgctgggccccggg 
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P18-19 479bp, micp42 3' of FLJ00083(SEQ ID NO:42) 

Cccgggttcctggcttgaaccctgtttctccctgttctgccaggcatgctggtccggaaggtgtgtgtngctgtnggctttaggtgggtg 

cagcccgtcccacgtcacggcgagctctgtttcctgggctggggacagtgaggtcatcgctgcccatcctggagctctggctcctttcg 

ggtacctgttccctctcccagagagacccccagctgcatgcaggcctagtgggctccacggcggagctggttcccaggctacctgggt 

tgccacctctgtgggtcccggctgccctctcgcagccgccgctacttcctcaccctcttggccctgcatttccacgtctcatggagccaa 

cgagagcagggggtttgagcccttgtggaaatctggggaggcactgcttctccctccatgtgagcagcttcacccagcctggggtcag 

tgcttacgctccacgcggcctggccttccccggg 
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